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polypeptide sequences are: SEQ ID 19 [HERV-K(C7)]; SEQ ID 20 [HERV-K10]; SEQ ID 21 
['ERVK6']; SEQ ID 73. 

HML-2 env polypeptide is encoded by the fourth long ORE in a complete HML-2 genome. 
The translated polypeptide is proteolytically cleaved. Examples of env nucleotide sequences are: 
' SEQ ID 22 [HERV-K(108)]; SEQ ID 23 [HERV-K(C7)] ; SEQ ID 24 [HERV-K(II)] ; SEQ ID 25 
[HERV-K10]. Examples of env polypeptide sequences are: SEQ ED 26 [HERV-K(C7)]; SEQ ID 
27 [HERV-K10] ; SEQ ID 28 ['ERVK6']. 

HML-2 cORF polypeptide is encoded by an ORF which shares the same 5' region and start 
codon as env. After around 87 codons, a splicing event removes env-coding sequences and the 
cORF-coding sequence continues in the reading frame +1 relative to that of env [19, 20]. cORF 
has also been called Rec [21]. Examples of cORF nucleotide sequences are: SEQ IDs 29 & 30 
[HERV-K(108)]. An example of a cORF polypeptide sequence is SEQ ID 31. 

The HML-2 polypeptide may alternatively be from a PCAP open-reading frame [22], such 
as PCAP1, PCAP2, PCAP3, PCAP4, PCAP4a or PCAP5 (SEQ IDs 32 to 37 herein). PCAP3 
1 5 (SEQ IDs 34 & 46) and PCAP5 are preferred (SEQ ID 37). 

The HML-2 polypeptide may alternatively be one of SEQ IDs 38 to 50 [22]. 

Sequences encoding any HML-2 polypeptide expression product may be used in 
accordance with the invention (e.g. sequences encoding any one of SEQ IDs 5, 6, 7, 8, 9, 13, 14, 
19, 20, 21, 26, 27, 28, 31-50, 69-74, 78 or 79). 

The invention may also utilize sequences encoding polypeptides having at least a% 
identity to such wild-type HML-2 polypeptide sequences. The value of a may be 65 or more (e.g. 
66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76„77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 
92, 93, 94, 95, 96, 97, 98, 99, 99.5, 99.9). These sequences include allelic variants, SNP variants, 
homologs, orthologs, paralogs, mutants etc. of the SEQ IDs listed in the previous paragraph. 

The invention may also utilize sequences having at least b% identity to wild-type HML-2 
nucleotide sequences. The value of b may be 65 or more (e.g. 66, 67, 68, 69, 70, 71, 72, 73, 74, 
75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 
99.5, 99.9). These sequences include allelic variants, SNP variants, homologs, orthologs, 
paralogs, mutants etc. of SEQ IDs 1, 2, 3, 4, 10, 11, 12, 15, 16, 17, 18, 22, 23, 24, 25, 29 and 30. ' 

The invention may also utilize sequences comprising a fragment of at least c nucleotides of 
such wild-type HML-2 nucleotide sequences. The value of c may be 7 or more (e.g. 8, 9, 10, 11, 
12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 30, 35, 40, 45, 50, 60, 70, 75, 80, 90, 100,' 
125, 150, 175, 200, 250, 300 or more). The fragment is preferably a proteolytic cleavage product 
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of a HML-2 polyproteiri. The fragment preferably comprises a sequence encoding a T-cell or, 
preferably, a B-cell epitope from HML-2. T- and B-cell epitopes can be identified empirically 
(e.g. using the PEPSCAN method [23, 24] or similar methods), or they can be predicted e.g. 
using the Jameson- Wolf antigenic index [25], matrix-based approaches [26], TEPITOPE [27], 
5 neural networks [28], OptiMer & EpiMer [29, 30], ADEPT [31], Tsites [32], hydrophilicity [33], 
antigenic index [34] or the methods disclosed in reference 35 etc. 

The invention may also utilize sequences encoding a polypeptide which comprises a 
fragment of at least d amino acids of wild-type HML-2 polypeptide sequences. The value of d 
may be 7 or more (e.g. 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 30, 35, 
10 40, 45, 50, 60, 70, 75, 80, 90, 100, 125, 150, 175, 200, 250, 300 or more). The fragment 
preferably comprises a T-cell or, preferably, a B-cell epitope from HML-2. 

The invention may also utilize sequences comprising (i) a first sequence which is a 
wild-type HML-2 sequence or a sequence as disclosed above and (ii) a second non-HML-2 
sequence. Examples of (ii) include sequences encoding: signal peptides, protease cleavage sites, 
15 epitopes, leader sequences, tags, fusion partners, N-terminal methionine, arbitrary sequences etc. 
Sequence (ii) \vill generally be located at the N- and/or C-terminus of (i). 

Even though a nucleotide sequence may encode a HML-2 polypeptide which is found 
naturally, it may differ from the corresponding natural nucleotide sequence. For example, the 
nucleotide sequence may include mutations e.g. to take into account codon preference in a host 
20 of interest, or to add restriction sites or tag sequences. 

THE SELECTABLE MARKER 

Vectors of the invention include a selectable marker. 

The marker preferably functions in a microbial host (e.g. in a prokaryote, in a bacteria, in a 
yeast). The marker is preferably a prokaryotic selectable marker (e.g. transcribed under the 
25 control of a prokaryotic promoter). 

For convenience, typical markers are antibiotic resistance genes. 

FURTHER FEATURES OF NUCLEIC ACW VECTORS OF THE INVENTION 

The vector of the invention is preferably an autonomously replicating episomal or 
extrachromosomal vector, such as a plasmid. 
30 The vector of the invention preferably comprises an origin of replication. It is preferred 

that the origin of replication is active in prokaryotes but not in eukaryotes. 

Preferred vectors thus include a prokaryotic marker for selection of the vector, a 
prokaryotic origin of replication, but a eukaiyotic promoter for driving transcription of the 
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HML-2 coding sequence. The vectors will therefore (a) be amplified and selected in prokaryotic 
hosts without HML-2 polypeptide expression, but (b) be expressed in eukaryotic hosts without 
being amplified. This is ideal for nucleic acid immunization vectors. 

The vector of the invention may comprise a eukaryotic transcriptional terminator sequence 
5 downstream of the HML2-coding sequence. This can enhance transcription levels. Where the 
HML2-coding sequence does not have its own, the vector of the invention preferably comprises 
a polyadenylation sequence. A preferred polyadenylation sequence is from bovine growth 
hormone. 

The vector of the invention may comprise a multiple cloning site 
10 In addition to sequences encoding a HML-2 polypeptide and a marker, the vector may 

comprise a second eukaryotic coding sequence. The vector may also comprise an IRES upstream 
of said second sequence in order to permit translation of a second eukaryotic polypeptide from 
the same transcript as the HML-2 polypeptide. Alternatively, the HML-2 polypeptide may be 
downstream of an IRES. 

. 15 The vector of the invention may comprise unmethylated CpG motifs e.g. unmethylated 

DNA sequences which have in common a cytosine preceding a guanosine, flanked by two 5 1 
purines and two 3* pyrimidines. In their unmethylated form these DNA motifs have been 
demonstrated to be potent stimulators of several types of immune cell. 

PHARMACEUTICAL COMPOSITIONS 

20 The invention provides a pharmaceutical composition comprising a vector of the invention. 

The invention also provides the vectors' use as medicaments, and their use in the manufacture of 
medicaments for treating prostate cancer. The invention also provides a method for treating a 
patient with a prostate tumor, comprising administering to them a pharmaceutical composition of 
the invention. The patient is generally a human, preferably a human male, and more preferably 

25 an adult human male. Other diseases in which HERV-Ks have been implicated include testicular 
cancer [36], multiple sclerosis [37], and insulin-dependent diabetes mellitus (IDDM) [38], and 
the vectors may also be used against these diseases. 

The invention also provides a method for raising an immune response, comprising 
administering an immunogenic dose of a vector of the invention to an animal (e.g. to a human). 
30 Pharmaceutical compositions encompassed by the present invention include as active 

agent, the vectors of the invention in a therapeutically effective amount. An "effective amount" 
is an amount sufficient to effect beneficial or desired results, including clinical results. An 
effective amount can be administered in one or more administrations. For purposes of this 
invention, an effective amount is an amount that is sufficient to palliate, ameliorate, stabilize, 

6 
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reverse, slow or delay the symptoms and/or progression of prostate cancer. The effect can be 
detected by, for example, chemical markers or antigen levels. Therapeutic effects also include 
reduction in physical symptoms. 

The precise effective amount for a subject will depend upon the subject's size and health, 
5 the nature and extent of the condition, and the therapeutics or combination of therapeutics 
selected for administration. The effective amount for a given situation is determined by routine 
experimentation and is within the judgment of the clinician. For purposes of the present 
invention, an effective dose will generally be from about 0.01mg/kg to about 5 mg/kg, or about 
0.01 mg/ kg to about 50 mg/kg or about 0.05 mg/kg to about 10 mg/kg of the compositions of the 
10 present invention in the individual to which it is administered. 

The compositions can be used to treat cancer as well as metastases of primary cancer. In 
addition, the pharmaceutical compositions can be used in conjunction with conventional methods 
of cancer treatment, e.g. to sensitize tumors to radiation or conventional chemotherapy. The 
terms "treatment", "treating", "treat" and the like are used herein to generally refer to obtaining a 

15 desired pharmacologic and/or physiologic effect. The effect may be prophylactic in terms of 
completely or partially preventing a disease or symptom thereof and/or may be therapeutic in 
terms of a partial or complete stabilization or cure for a disease and/or adverse effect attributable 
to the disease. "Treatment" as used herein covers any treatment of a disease in a mammal, 
particularly a human, and includes: (a) preventing the disease or symptom from occurring in a 

20 subject which may be predisposed to the disease or symptom but has not yet been diagnosed as 
having it; (b) inhibiting the disease symptom, i.e. arresting its development; or (c) relieving the 
disease symptom, i.e. causing regression of the disease or symptom. 

A pharmaceutical composition can also contain a pharmaceutically acceptable carrier. The 
term pharmaceutically acceptable carrier" refers to a carrier for administration of a therapeutic 

25 agent, such as antibodies or a polypeptide, genes, and other therapeutic agents. The term refers to 
any pharmaceutical carrier that does not itself induce the production of antibodies harmful to the 
individual receiving the composition, and which can be administered without undue toxicity. 
Suitable carriers can be large, slowly metabolized macromolecules such as proteins, 
polysaccharides, polylactic acids, polyglycolic acids, polymeric amino acids, amino acid 

30 copolymers, and inactive virus particles. Such carriers are well known to those of ordinary skill 
in the art. Pharmaceutically acceptable carriers in therapeutic compositions can include liquids 
such as water, saline, glycerol and ethanol. Auxiliary substances, such as wetting or emulsifying 
agents, pH buffering substances, and the like, can also be present in such vehicles. Typically, the 
therapeutic compositions are prepared as injectables, either as liquid solutions or suspensions; 

35 solid forms suitable for solution in, or suspension in, liquid vehicles prior to injection can also be 
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prepared. Liposomes are included within the definition of a pharmaceutical!)^ acceptable carrier. 
Pharmaceutical^ acceptable salts can also be present in the pharmaceutical composition, e.g. 
mineral acid salts such as hydrochlorides, hydrobromides, phosphates, sulfates, and the like; and 
the salts of organic acids such as acetates, propionates, malonates, benzoates, and the like. A 
5 thorough discussion of pharmaceutically acceptable excipients is available in reference 39. 

The composition is preferably sterile and/or pyrogen-free. It will typically be buffered at 
about pH 7. 

Once formulated, the compositions contemplated by the invention can be.(l) administered 
directly to the subject; or (2) delivered ex vivo, to cells derived from the subject (e.g. as in ex 
10 vivo gene therapy). Direct delivery of the compositions will generally be accomplished by 
parenteral injection, e.g. subcutaneously, intraperitoneal^, intravenously or intramuscularly, 
. intratumoral or to the interstitial space of a tissue. Other modes of administration include oral 
and pulmonary administration, suppositories, and transdermal applications, needles, and gene 
guns or hyposprays. Dosage treatment can be a single dose schedule or a multiple dose schedule. 

4 15 Intramuscular injection is preferred. 

Methods for the ex vivo delivery and reimplantation of transformed cells into a subject are 
known in the art [e.g. ref. 40]. Examples of cells useful in ex vivo applications include, for 
example, stem cells, particularly hematopoetic, lymph cells, macrophages, dendritic cells, or 
tumor cells. Generally, delivery of nucleic acids for both ex vivo and in vitro applications can be 
20 accomplished by, for example, dextran-mediated transfection, calcium phosphate precipitation, 
polybrene mediated transfection, protoplast fusion, electroporation, encapsulation of the nucleic 
acid(s) in liposomes, and direct microinjection of the DNA into nuclei, all well known in the art. 

Targeted delivery 

Vectors of the invention may be delivered in a targeted way. 

25 Receptor-mediated DNA delivery techniques are described in, for example, references 41 

to 46. Therapeutic compositions containing a nucleic acid are administered in a range of about 
lOOng to about 200mg of DNA for local administration in a gene therapy protocol. 
Concentration ranges of about 500 ng to about 50 mg, about 1 ng to about 2 mg, about 5\ig to 
about 500jig, and about 20^g to about lOOjag of DNA can also be used during a gene therapy 

30 protocol. Factors such as method of action (e.g. for enhancing or inhibiting levels of the encoded 
gene product) and efficacy of transformation and expression are considerations which will affect 
the dosage required for ultimate efficacy. Where greater expression is desired over a larger area 
of tissue, larger amounts of vector or the same amounts re-administered in a successive protocol 
of administrations, or several administrations to different adjacent or close tissue portions of e.g. 

8 
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a tumor site, may be required to effect a positive therapeutic outcome. In all cases, routine 
experimentation in clinical trials will determine specific ranges for optimal therapeutic effect. 

Vectors can be delivered using gene delivery vehicles. The gene delivery vehicle can be of 
viral or non-viral origin (see generally references 47 to 50). 

5 Viral-based vectors for delivery of a desired nucleic acid and expression in a desired cell 

are well known in the art. Exemplary viral-based vehicles include, but are not limited to, 
recombinant retroviruses (e.g. references 51 to 61), alphavirus-based vectors (e.g. Sindbis virus 
vectors, Semliki forest virus (ATCC VR-67; ATCC VR-1247), Ross River virus (ATCC VR- 
373; ATCC VR-1246) and Venezuelan equine encephalitis virus (ATCC VR-923; ATCC VR- 
10 1 250; ATCC VR 1249; ATCC VR-532); hybrids or chimeras of these viruses may also be used), 
poxvirus vectors (e.g. vaccinia, fowlpox, canarypox, modified vaccinia Ankara, etc.\ adenovirus 
vectors, and adeno-associated virus (AAV) vectors (e.g. see refs. .62 to 67). Administration of 
DNA linked to killed adenovirus [68] can also be employed. 

Non-viral delivery vehicles and methods can also be employed, including, but not limited 
15 to, polycationic condensed DNA linked or unlinked to killed adenovirus alone [e.g. 68], ligand- 
linked DNA [69], eukaryotic cell delivery vehicles cells [e.g. refs. 70 to 74] and nucleic charge 
neutralization or fusion with cell membranes. Naked DNA can also be employed. Exemplary 
naked DNA introduction methods are described in refs. 75 and 76. Liposomes (e.g. 
immunoliposomes) that can act as gene delivery vehicles are described in refs. 77 to 81. 
20 Additional approaches are described in refs. 82 & 83. 

Further non-viral delivery suitable for use includes mechanical delivery systems such as 
the approach described in ref. 83. Moreover, the coding sequence and the product of expression 
of such can be delivered through deposition of photopolymerized hydrogel materials or use of 
ionizing radiation [e.g. refs. 84 & 85]. Other conventional methods for gene delivery that can be 
25 used for delivery of the coding sequence include, for example, use of hand-held gene transfer 
particle gun [86] or use of ionizing radiation for activating transferred genes [84 & 87]. 

Delivery DNA using PLG (poly(lactide-co-glycolide)} microparticles is a particularly 
preferred method e.g. by adsorption to the microparticles, which are optionally treated to have a 
negatively-charged surface (e.g. treated with SDS) or a positively-charged surface (e.g. treated 
30 with a cationic detergent, such as CTAB). 

Vaccine compositions 

The pharmaceutical composition is preferably an immunogenic composition and is more 
preferably a vaccine composition. Such compositions can be used to raise antibodies in a 
mammal (e.g. a human) and/or to raise a cellular immune response (e.g. a response involving 



Copy provided by USPTO from the PACR Image Database on 07/22/2003 



604-72189 .055003 




PATENT 
PP-19482.002 

T-cells such as CTLs, a response involving natural killer cells, a response involving 
macrophages etc.) 

The invention provides the use of a vector of the invention in the manufacture of 
medicaments for preventing prostate cancer. The invention also provides a method for protecting 
5 a patient from prostate cancer, comprising administering to them a pharmaceutical composition 
of the invention. 

Nucleic acid immunization is well known [e.g. refs. 88 to 94 etc.] 

The composition may additionally comprise an adjuvant. For example, the composition 
may comprise one or more of the following adjuvants: (1) oil-in- water emulsion formulations 

10 (with or without other specific immunostimulating agents such as muramyl peptides (see below) 
or bacterial cell wall components), such as for example (a) MF59™ [95; Chapter 10 in ref. 96], 
containing 5% Squalene, 0.5% Tween 80, and 0.5% Span 85 (optionally containing MTP-PE) 
formulated into submicron particles using a microfluidizer, (b) SAF, containing 10% Squalane, 
0.4% Tween 80, 5% pluronic-blocked polymer L121, and thr-MDP either microfluidized into a 

15 submicron emulsion or vortexed to generate a larger particle size emulsion, and (c) Ribi™ 
adjuvant system (RAS), (Ribi Immunochem, Hamilton, MT) containing 2% Squalene, 0.2% 
Tween 80, and one or more bacterial cell wall components from the group consisting of 
monophosphorylipid A (MPL), trehalose dimycolate (TDM), and cell wall skeleton (CWS), 
preferably MPL + CWS (DetoxTM); (2) saponin adjuvants^ such as QS21 or StimulonTM 

20 (Cambridge Bioscience, Worcester, MA) may be used or particles generated therefrom such as 
ISCOMs (immunostimulating complexes), which ISCOMS may be devoid of additional 
detergent [97]; (3) Complete Freund's Adjuvant (CFA) and Incomplete Freund's Adjuvant 
(IFA); (4) cytokines, such as interleukins (e.g. ILrl, IL-2, IL-4, IL-5, IL-6, IL-7, IL-12 etc.), 
interferons (e.g. gamma interferon), macrophage colony stimulating factor (M-CSF), tumor 

25 necrosis factor (TNF), etc.; (5) monophosphoryl lipid A (MPL) or 3-O-deacylated MPL 
(3dMPL) [e.g. 98, 99]; (6) combinations of 3dMPL with, for example, QS21 and/or oil-in-water 
emulsions [e.g. 100, 101, 102]; (7) oligonucleotides comprising CpG motifs i.e. containing at 
least one CG dinucleotide, with 5-methylcytosine optionally being used in place of cytosine; (8) 
a polyoxyethylene ether or a polyoxyethylene ester [103]; (9) a polyoxyethylene sorbitan ester 

30 surfactant in combination with an octoxynol [104] or a polyoxyethylene alkyl ether or ester 
surfactant in combination with at least one additional non-ionic surfactant such as an octoxynol 
[105]; (10) an immunostimulatory oligonucleotide (e.g. a CpG oligonucleotide) and a saponin 
[106]; (11) an immunostimulant and a particle of metal salt [107]; (12) a saponin and an oil-in- 
water emulsion [108]; (13) a saponin (e.g. QS21) + 3dMPL + IL-12 (optionally + a sterol) [109]; 

35 (14) aluminium salts, preferably hydroxide or phosphate, but any other suitable salt may also be 

10 
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used (e.g. hydroxyphosphate, oxyhydroxide, orthophosphate, sulphate etc. [chapters 8 & 9 of ref. 
96]). Mixtures of different aluminium salts may also be used. The salt may take any suitable 
form (e.g. gel, crystalline, amorphous etc.); (15) chitosan; (16) cholera toxin or E.coli heat labile 
toxin, or detoxified mutants thereof [110]; (17) microparticles (Le. a particle of ~100nin to 
5 —150pm in diameter, more preferably ~200nm to ~30pm in diameter, and most preferably 
~500nm to ~10jim in diameter) formed from materials that are biodegradable and non-toxic (e.g. 
a poly(a-hydroxy acid), a polyhydroxybutyric acid, a polyorthoester, a polyanhydride, a 
polycaprolactone etc., such as poiy(lactide-co-glycolide) etc.) optionally treated to have a 
negatively-charged surface (e.g. with SDS) or a positively-charged surface (e.g. with a cationic 

10 detergent, such as CTAB); (18) monophosphoryl lipid A mimics, such as aminoalkyl 
glucosaminide phosphate derivatives e.g. RC-529 [111]; (19) polyphosphazene (PCPP); (20) a 
bioadhesive [112] such as esterified hyaluronic acid microspheres [113] or a mucoadhesive 
selected from the group consisting of cross-linked derivatives of poly(acrylic acid), polyvinyl 
alcohol, polyvinyl pyrollidone, polysaccharides and carboxymethylcellulose; (21) double- 

15 stranded RNA; or (22) other substances that act as immunostimulating agents to enhance the 
efficacy of the composition. Aluminium salts and/or MF59™ are preferred. 

Vaccines of the invention may be prophylactic (i.e. to prevent disease) or therapeutic (i.e. 
to reduce or eliminate the symptoms of a disease). 

SPECIFIC VECTORS OF THE INVENTION 
20 Preferred vectors of the invention comprise: (i) a eukaryotic promoter, (ii) a sequence 

encoding a HML-2 polypeptide downstream of and operably linked to said promoter; (iii) a 
prokaryotic selectable marker; (iv) a prokaryotic origin of replication; and (v) a eukaryotic 
transcription terminator downstream of and operably linked to said sequence encoding a HML-2 
polypeptide. 

25 Particularly preferred vectors are shown in figures 2 to 8 (SEQ IDs 51 to 56 & 80). 

VIRUS-LIKE PARTICLES 

HML-2 gag polypeptide has been found to assemble into virus-like particles (VLPs). This 
particulate form of the polypeptide has enhanced immunogenicity when compared to soluble 
polypeptide and is a preferred form of polypeptide for use in immunization and/or diagnosis. 
30 Thus the invention provides a virus-like particle, comprising HML-2 gag polypeptide. The 

gag polypeptide may be myristoylated at its N-terminus. 

The invention also provides a VLP of the invention for use as an immunogen or for use as 
a diagnostic antigen. The invention also provides the use of a VLP of the invention in the 
manufacture of a medicament for immunizing an animal. 

11. 
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The invention also provides a method of raising an immune response in an animal, 
comprising administering to the animal a VLP of the invention. The immune response may 
comprise a humoral immune response and/or a cellular immune response. 

For raising an immune response, the VLP may be administered with or without an adjuvant 
as disclosed above. The immune response may treat or protect against cancer (e.g. prostate 
cancer). 

The invention also provides a method for diagnosing cancer (e.g. prostate cancer) in a 
patient, comprising the step of contacting antibodies from the patient with VLPs of the invention. 
Similarly, the invention provides a method for diagnosing cancer (e.g. prostate cancer) in a 
patient, comprising the step of contacting anti-VLP antibodies with a patient sample. 

The invention also provides a process for preparing VLPs of the invention, comprising the 
step of expressing gag polypeptide in a cell, and collecting VLPs from the cell. Expression may 
be achieved using a vector of the invention. 

The VLP of the invention may or may not include packaged nucleic acid. 

The gag polypeptide from which the VLPs are made can be from any suitable HML-2 
virus (e.g. SEQ IDs 1-9, 69 & 78). 

DEFINITIONS 

The term "comprising" means "including" as well as "consisting" e.g. a composition 
"comprising" X may consist exclusively of X or may include something additional e.g. X + Y. 

The term "about" in relation to a numerical value x means, for example, *±10%. 

The terms "neoplastic cells", "neoplasia", "tumor", "tumor cells", "cancer" and "cancer 
cells" (used interchangeably) refer to cells which exhibit relatively autonomous growth, so that 
, they exhibit an aberrant growth phenotype characterized by a significant loss of control of cell 
proliferation (i.e. de-regulated cell division). Neoplastic cells can be malignant or benign and 
include prostate cancer derived tissue. 

References to a percentage sequence identity between two nucleic acid sequences mean 
that, when aligned, that percentage of bases are the same in comparing the two sequences. This 
alignment and the percent homology or sequence identity can be determined using software 
programs known in the art, for example those described in section 7.7.18 of reference 1 14. A 
preferred alignment program is GCG Gap (Genetics Computer Group, Wisconsin, Suite Version 
10.1), preferably using default parameters, which are as follows: open gap = 3; extend gap = 1 . 

References to a percentage sequence identity between two amino acid sequences means 
that, when aligned, that percentage of amino acids are the same in comparing the two sequences. 

12 




PATENT 
PP-19482.002 

This alignment and the percent homology or sequence identity can be determined using software 
programs known in the art, for example those described in section 7.7.18 of reference 114. A 
preferred alignment is determined by the Smith- Waterman homology search algorithm using an 
affine gap search with a gap open penalty of 12 and a gap extension penalty of 2, BLOSUM 
5 matrix of 62. The Smith- Waterman homology search algorithm is taught in reference 115. 

BRIEF DESCRIPTION OF DRAWINGS 

Figure 1 shows the pCMVkm2 vector, and Figures 2 to 8 show vectors formed by inserting 
sequences encoding HML-2 polypeptides into this vector. 

Figure 9 shows the location of coding sequences in the HML2.HOM genome, with 
1 0 nucleotide numbering according to ref. 5. 

Figure 10 is a western blot showing gag expression in transfected 293 cells. Lanes 1 to 4 
are: (1) gag opt HML-2; (2) gag opt PCAV; (3) gag wt PCAV; (4) mock. 

Figure 11 also shows western blots of transfected 293 cells. In Figure 11 A the staining 
antibody was anti-HML-2, but in Figure 1 IB it was anti-PCAV. In both 1 1 A and 1 IB lanes 1 to 
15 4 are: (1) mock; (2) gag opt HML-2; (3) gag opt PCAV; (4) gag wt PCAV. The upper arrow 
shows the position of gag; the lower arrow shows the B-actin control. 

Figure 12 shows electron microscopy of 293 cells expressing (12A) gag opt PCAV or 
(12B) gag opt HML-2. 

MODES FOR CARRYING OUT THE INVENTION 

20 Certain aspects of the present invention are described in greater detail in the non-limiting 

examples that follow. The examples are put forth so as to provide those of ordinary skill in the 
art with a disclosure and description of how to make and use the present invention, and are not 
intended to limit the scope of what the inventors regard as their invention nor are they intended 
to represent that the experiments below are all and only experiments performed. Efforts have 

25 been made to ensure accuracy with respect to numbers used (e.g. amounts, temperature, etc.) but 
some experimental errors and deviations should be accounted for. Unless indicated otherwise, 
parts are parts by weight, molecular weight is weight average molecular weight, temperature is 
in degrees Celsius, and pressure is at or near atmospheric. 

Vectors for expressing HML-2 polypeptides 
30 The basic pCMVkm2 vector is shown in figure 1. This vector has an immediate-early 

CMV enhancer/promoter and a bovine growth hormone transcription terminator, with a multiple 
cloning site in between. The vector also has a kanamycin resistance gene and a ColEl origin of 
replication. 

13 
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Sequences coding for HML-2 polypeptides being inserted between Sail and EcoRI in the 
multiple cloning site: 



Figure 


SEQ n> 


HML-2 polypeptide 


2 


51 


cORF I 


3 


52 


PCAP5 


4 


53 


gag 


5 


54 


gag 


6 


55 


Pit 


7 


56 


Pol 



10 



15 



20 



25 



30 



35 



40 



45 



Except for the vector shown in figure 4 (SEQ ID 53), the inserted sequences were 
manipulated for codon preference, including addition of an optimal stop codon: 

cORF manipulation: 

Start with SEQ ID 57 (SEQ ID 43); manipulate to SEQ ID 58 (SEQ ID 67): 

ATGAACCCATCAGAGATGCAAAGAAAAGCACCTCCGCGGAGACGGAGACATC cORFwt_hral (1) 
ATGAACCCCAGCGAGATGCAGCGCAAGGCCCCCCCCCGCCGCCGCCGCCACC corf opt_hml ( 1 ) 

GCAATCGAGCACCGTTGACTCACAAGATGAACAAAATGGTGACGTCAGAAGA cORFw t_hml (53) 
GCAACCGCGCCCCCCTGACCCACAAGATGAACAAGATGGTGACCAGCGAGGA corf opt_hml ( 53) 

ACAGATG7VAGTTGCCATCCACCAAGAAGGCAGAGCCGCCAACTTGGGCACAA cORFwt_hml ( 105 ) 
GCAGATGAAGCTGCCCAGCACCAAGAAGGCCGAGCCCCCCACCTGGGCCCAG corf opt_hiul ( 105 ) 

CTAAAGAAGCTGACGCAGTTAGCTACAAAATATCTAGAGAACACAAAGGTGA cORFwt_hml (157) 
CTGAAGAAGCTGACCCAGCTGGCCACCAAGTACCTGGAGAACACCAAGGTGA corf opt_hml (157) 

CACAAACCCCAGAGAGT ATGCTGCTTGCAGCCTTGATGATTGTATCAATGGT cORFw t__hml (209) 
CCCAGACCCCCGAGAGCATGCTGCTGGCCGCCCTGATGATCGTGAGCATGGT cor f opt_hml (209) 

GTCTGCAGGTGTACCCAACAGCTCCGAAGAGACAGCGACCATCGAGAACGGG cORFwt_hml (261) 
GAGCGCCGGCGTGCCCAACAGCAGCGAGGAGACCGCCACCATCGAGAACGGC cor f opt_hml (261) 



CCA TGA 

CCCGCTTAA 



cORFwtJunl (313) 
corf opt_hml (313) 



PCAP5 manipulation: 

Start with SEQ ID 59 (SEQ ID 37); manipulate to SEQ ID 60 (SEQ ID 68): 

ATGAACCCATCGGAGATGCAAAGAAAAGCACCTCCGCGGAGACGGAGACAT pCAP5wt_hml ( 1 ) 
ATGAACCCCAGCGAGATGCAGCGCAAGGCCCCCCCCCGCCGCCGCCGCCAC pcap5opt_hml ( 1 ) 

CGCAATCGAGCACCGTTGACTCACAAGATGAACAAAATGGTGACGTCAGAA pCAP5wt_hml ( 52 ) 
CGCAACCGCGCCCCCCTGACCCACAAGATGAACAAGATGGTGACCAGCGAG pcap5opt_hml ( 52 ) 

GAACAGATGAAGTTGCCATCCACCAAGAAGGCAGAGCCGCCAACTTGGGCA pCAPSwt Jiml (103) 
GAGCAGATGAAGCTGCCCAGCACCAAGAAGGCCGAGCCCCCCACCTGGGCC pcap5opt_hml (103) 

CAACTAAAGAAGCTGACGCAGTTAGCTACAAAATATCTAGAGAACACAAAG pCAP5wt_hml (154) 
CAGCTGAAGAAGCTGACCCAGCTGGCCACCAAGTACCTGGAGAACACCAAG pcap5opt_hml (154) 

GTGACACAAACCCCAGAGAGTATGCTGCTTGCAGCCTTGATGATTGTATCA pCAP5wt_hml (205) 
GTGACCCAGACCCCCGAGAGCATGCTGCTGGCCGCCCTGATGATCGTGAGC pcap5opt_hral { 205 ) 

ATGGTGGTGTACCCAACAGCTCCGAAGAGACAGCGACCATCGAGAACGGGC pCAP5wt_hml (256) 
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ATGGTGGTGTACCCCACCGCCCCCAAGCGCCAGCGCCCCAGCCGCACCGGC pcap5opt_hml (256) 

CATGATGACGATGGCGGTTTTGTCGAAAAGAAAAGGGGGAAATGTGGGGAA pCAP5 wt_hml (307) 
CACGACGACGACGGCGGCTTCGTGGAGAAGAAGCGCGGCAAGTGCGGCGAG pcap5opt_hml (307) 

AAGCAAGAGAGATCAGATTGTTACTGTGTCTGTGTAGAAAGAAGTAGACAT pCAP5wt_hml (358) 
AAGCAGGAGCGCAGCGACTGCTACTGCGTGTGCGTGGAGCGCAGCCGCCAC pcap5opt_hml (358) 

AGGAGACTCCATTTTGTTCTGTAC TAA pCAP5wt_hml (409) 

10 CGCCGCCTGCACTTCGTGCTGTACGCTTAA pcap5opt_hml (409) 

Gag manipulation: 

Start with SEQ ID 61 (SEQ ID 69); manipulate to SEQ ID 62 (SEQ ID 70): 

ATGGGGCAAACTAAAAGTAAAATTAAAAGTAAATATGCCTCTTATCTCAGCT gagwt_hml (1) 
15 ATGGGCCAGACCAAGAGCAAGATCAAGAGCAAGTACGCCAGCTACCTGAGCT gagoptjiml (1) 

TTATTAAAATTCTTTTAAAAAGAGGGGGAGTTAAAGTATCTACAAAAAATCT gagwt_hml (53) 
TCATCAAGATCCTGCTGAAGCGCGGCGGCGTGAAGGTGAGCACCAAGAACCT gagopt_hml (53) 

20 7VATCAAGCTATTTCAAATAATAGAACAATTTTGCCCATGGTTTCCAGAACAA gagwt_hml (105) 

GATCAAGCTGTTCCAGATCATCGAGCAGTTCTGCCCCTGGTTCCCCGAGCAG gagopt_hml (105) 



25 
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GGAACTTTAGATCTAAAAGATTGGAAAAGAATTGGTAAGGAACTAAAACAAG gagwt_Jiml (157) 
GGCACCCTGGACCTGAAGGACTGGAAGCGCATCGGCAAGGAGCTGAAGCAGG gagopt_hml { 157 ) 

CAGGTAGGAAGGGTAATATCATTCCACTTACAGTATGGAATGATTGGGCCAT gagwtjiml (209) 
CCGGCCGCAAGGGCAACATCATCCCCCTGACCGTGTGGAACGACTGGGCCAT gagoptjiml ( 209 ) 

TATTAAAGCAGCTTTAGAACCATTTCAAACAGAAGAAGATAGCGTTTCAGTT gagwt_hml (261 ) 
30 CATCAAGGCCGCCCTGGAGCCCTTCCAGACCGAGGAGGACAGCGTGAGCGTG gagopt_hml (261) 

TCTGATGCCCCTGGAAGCTGTATAATAGATTGTAATGAAAACACAAGGAAAA gagwt_hml (313) 
AGCGACGCCCCCGGCAGCTGCATCATCGACTGCAACGAGAACACCCGCAAGA gagopt hml (313) 

35 AATCCCAGAAAGAAACGGAAGGTTTACATTGCGAATATGTAGCAGAGCCGGT gagwt_hml (365) 

AGAGCCAGAAGGAGACCGAGGGCCTGCACTGCGAGTACGTGGCCGAGCCCGT gagopt_hral (365) 

AATGGCTCAGTCAACGCAAAATGTTGACTATAATCAATTACAGGAGGTGATA gagwt_hml (417) 
GATGGCCCAGAGCACCCAGAACGTGGACTACAACCAGCTGCAGGAGGTGATC gagopt_hml (417) 

TATCCTGAAACGTTAAAATTAGAAGGAAAAGGTCCAGAATTAGTGGGGCCAT gagwt_hml (469) 
TACCCCGAGACCCTGAAGCTGGAGGGCAAGGGCCCCGAGCTGGTGGGCCCCA gagopt_hml (4 69) 

CAGAGTCTAAACCACGAGGCACAAGTCCTCTTCCAGCAGGTCAGGTGCCTGT gagwt_hml ( 521 ) 
45 GCGAGAGCAAGCCCCGCGGCACCAGCCCCCTGCCCGCCGGCCAGGTGCCCGT gagoptjiml (521) 

AACATTACAACCTCAAAAGCAGGTTAAAGAAAATAAGACCCAACCGCCAGTA gagwt_hml (573) 
GACCCTGCAGCCCCAGAAGCAGGTGAAGGAGAACAAGACCCAGCCCCCCGTG gagopt_hml ( 573 ) 

50 GCCTATCAATACTGGCCTCCGGCTGAACTTCAGTATCGGCCACCCCCAGAAA gagwt_hml (625) 

GCCTACCAGTACTGGCCCCCCGCCGAGCTGCAGTACCGCCCCCCCCCCGAGA gagoptjiml (625) 

GTCAGTATGGATATCCAGGAATGCCCCCAGCACCACAGGGCAGGGCGCCATA gagwtjiml ( 677 ) 
GCCAGTACGGCTACCCCGGCATGCCCCCCGCCCCCCAGGGCCGCGCCCCCTA gagoptjiml ( 677 ) 

CCCTCAGCCGCCCACTAGGAGACTTAATCCTACGGCACCACCTAGTAGACAG gagwtjiml (729) 
CCCCCAGCCCCCCACCCGCCGCCTGAACCCCACCGCCCCCCCCAGCCGCCAG gagop t_hml (729) 

GGTAGTAAATTACATGAAATTATTGATAAATCAAGAAAGGAAGGAGATACTG gagwtjiml (781) 
60 GGCAGCAAGCTGCACGAGATCATCGACAAGAGCCGCAAGGAGGGCGACACCG gagopt_hml (781) 

AGGCATGGCAATTCCCAGTAACGTTAGAACCGATGCCACCTGGAGAAGGAGC gagwtjiml ( 833 ) 
AGGCCTGGCAGTTCCCCGTGACCCTGGAGCCCATGCCCCCCGGCGAGGGCGC gagoptjiml ( 833 ) 

65 CCAAGAGGGAGAGCCTCCCACAGTTGAGGCCAGATACAAGTCTTTTTCGATA gagwtjiml (885) 

CCAGGAGGGCGAGCCCCCCACCGTGGAGGCCCGCTACAAGAGCTTCAGCATC gagopt_hml (885) 
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AAAAAGCTAAAAGATATGAAAGAGGGAGTAAAACAGTATGGACCCAACTCCC gagwt_hml ( 937 ) 
AAGAAGCTGAAGGACATGAAGGAGGGCGTGAAGCAGTACGGCCCCAACAGCC gagop t_hml (937) 

CTTATATGAGGACATTATTAGATTCCATTGCTCATGGACATAGACTCATTCC gagwt_hml (98 9) 
5 CCTACATGCGCACCCTGCTGGACAGCATCGCCCACGGCCACCGCCTGATCCC gagop tjhml (989) 

TTATGATTGGGAGATTCTGGCAAAATCGTCTCTCTCACCCTCTCAATTTTTA gagwt_hml (1041) 
CTACGACTGGGAGATCCTGGCCAAGAGCAGCCTGAGCCCCAGCCAGTTCCTG gagopt_hml (1041) 

10 CAATTTAAGACTTGGTGGATTGATGGGGTACAAGAACAGGTCCGAAGAAATA gagwt_hml (1093) 

CAGTTCAAGACCTGGTGGATCGACGGCGTGCAGGAGCAGGTGCGCCGCAACC gagop t_hml ( 1093 ) 

GGGCT GCCAATCCT CCAGTTAAC ATAG ATGCAGATCAACTATTAGGAATAGG gagwt_hml (114 5) 
GCGCCGCCAACCCCCCCGTGAACATCGACGCCGACCAGCTGCTGGGCATCGG gagop t_hml (1145) 

TCAAAATTGGAGTACTATTAGTCAACAAGCATTAATGCAAAATGAGGCCATT gagwt_hml (1197) 
CCAGAACTGGAGCACCATCAGCCAGCAGGCCCTGATGCAGAACGAGGCCATC gagopt_hml (1197) 

GAGCAAGTTAGAGCTATC TGCC TTAGAGCCT GGGAAAAAATCCAAGACCCAG gagwt_hml (124 9) 
20 GAGCAGGTGCGCGCCATCTGCCTGCGCGCCTGGGAGAAGATCCAGGACCCCG gag op t_hml (1249) 

GAAGTACCTGCCCCTCATTTAATACAGTAAGACAAGGTTCAT^AGAGCCCTA gag wt_hml (1301) 
GCAGCACCTGCCCCAGCTTCAACACCGTGCGCCAGGGCAGCAAGGAGCCCTA gagopt__hml (1301) 

25 TCCTGATTTTGTGGCAAGGCTCC/^AGATGTTGCTCAAAAGTCAATTGCTGAT gagwtjhinl (1353) 

CCCCGACTTCGTGGCCCGCCTGCAGGACGTGGCCCAGAAGAGCATCGCCGAC gagoptjiml (1353) 

GAAAAAGCCCGTAAGGTCATAGTGGAGTTGATGGCATATGAAAACGCCAATC gagwt_hml (1405) 
^ GAGAAGGCCCGCAAGGTGATCGTGGAGCTGATGGCCTACGAGAACGCCAACC gagopt_hinl (1405) 

CTGAGTGTCAATCAGCCATTAAGCCATTAAAAGGAAAGGTTCCTGCAGGATC gagwH_hml (14 57) 
•CCGAGTGCCAGAGCGCCATCAAGCCCCTGAAGGGCAAGGTGCCCGCCGGCAG gagopt_hml (1457) 

AGATGTAATCTCAGAATATGTAAAAGCCTGTGATGGAATCGGAGGAGCTATG gagwtjhral (1509) 
35 CGACGTGATCAGCGAGTACGTGAAGGCCTGCGACGGCATCGGCGGCGCCATG . gagop t_hml (1509) 

CATAAAGCTATGCTTATGGCTCAAGCAATAACAGGAGTTGTTTTAGGAGGAC gagwtjiml (1561) 
CACAAGGCCATGCTGATGGCCCAGGCCATCACCGGCGTGGTGCTGGGCGGCC gagop t_hml (1561) 

40 AAGTTAGAACATTTGGAAGAAAATGTTATAATTGTGGTCAAATTGGTCACTT gagwt_hral (1613) 

AGGTGCGCACCTTCGGCCGCAAGTGCTACAACTGCGGCCAGATCGGCCACCT gagoptjiml ( 1613 ) 

AAAAAAGAATTGCCCAGTCTTAAATAAACAGAATATAACTATTCAAGCAACT gagwt_hral (1665) 
^ GAAGAAGAACTGCCCCGTGCTGAACAAGCAGAACATCACCATCCAGGCCACC gagoptjiml (1665) 

ACAACAGGTAGAGAGCCACCTGACTTATGTCCAAGATGTAAAAAAGGAAAAC gagwt Jiral ( 1717 ) 
ACCACCGGCCGCGAGCCCCCCGACCTGTGCCCCCGCTGCAAGAAGGGCAAGC gagoptjiml (1717) 

ATTGGGCTAGTCAATGTCGTTCTAAATTTGAT.AAAAATGGGCAACCATTGTC gagwt Juul (1769) 
50 ACTGGGCCAGCCAGTGCCGCAGCAAGTTCGACAAGAACGGCCAGCCCCTGAG gagoptjiml (1769) 

GGGAAACGAGCAAAGGGGCCAGCCTCAGGCCCCACAACAAACTGGGGCATTC gagwtjiml (1821) 
CGGCAACGAGCAGCGCGGCCAGCCCCAGGCCCCCCAGCAGACCGGCGCCTTC ga gopt Jiml (1821) 

55 CCAATTCAGCCATTTGTTCCTCAGGGTTTTCAGGGACAACAACCCCCACTGT gagwtjiml (1873) 

CCCATCCAGCCCTTCGTGCCCCAGGGCTTCCAGGGCCAGCAGCCCCCCCTGA gagoptjiml (1873) 

CCCAAGTGTTTCAGGGAATAAGCCAGTTACCACAATACAACAATTGTCCCCC gagwtjiml (1925) 
„ GCCAGGTGTTCCAGGGCATCAGCCAGCTGCCCCAGTACAACAACTGCCCCCC gagopt hml (1925) 

GCCACAAGCGGCAGTGCAGCAG TAG gagwt__hml (1977) 

CCCCCAGGCCGCCGTGCAGCAGGCTTAA gagoptjiml { 1977 ) 

Prt manipulation: 

65 Start with SEQ ID 63 (SEQ ID 71); manipulate to SEQ ID 64 (SEQ ID 72): 

ATGTGGGCAACCATTGTCGGGAAACGAGCAAAGGGGCCAGCCTCAGGCCCCA Pr ot wt Jiml ( 1 ) 
ATGTGGGCCACCATCGTGGGCAAGCGCGCCAAGGGCCCCGCCAGCGGCCCCA protopt Jiml { 1 ) 
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CAACAAACTGGGGCATTCCCAATTCAGCCATTTGTTCCTCAGGGTTTTCAGG Protwtjiml ( 53 ) 
CCACCAACTGGGGCATCCCCAACAGCGCCATCTGCAGCAGCGGCTTCAGCGG protopt_hml (53) 

GACAACAACCCCCACTGTCCCAAGTGTTTCAGGGAATAAGCCAGTTACCACA Pr otwt_hral (105) 
CACCACCACCCCCACCGTGCCCAGCGTGAGCGGCAACAAGCCCGTGACCACC prot opt Jhml (105) 

ATACAACAATTGTCCCCCGCCACAAGCGGCAGTGCAGCAGTAGATTTATGTA Prot wt_hml (157) 
ATCCAGCAGCTGAGCCCCGCCACCAGCGGCAGCGCCGCCGTGGACCTGTGCA protopt_hml (157) 

CTATACAAGCAGTCTCTCTGCTTCCAGGGGAGCCCCCACAAAAAACCCCCAC Protwt_hml (209) 
CCATCCAGGCCGTGAGCCTGCTGCCCGGCGAGCCCCCCCAGAAGACCCCCAC protop t_hml (209) 

AGGGGTATATGGACCCCTGCCTAAGGGGACTGTAGGACTAATCTTGGGACGA Protwt_hral (261) 
1 5 CGGCGTGTACGGCCCCCTGCCCAAGGGCACCGTGGGCCTGATCCTGGGCCGC protopt_hml (261) 

TCAAGTCTAAATCTAAAAGGAGTTCAAATTCATACTAGTGTGGTTGATTCAG Protwt_hral (313) 
AGCAGCCTGAACCTGAAGGGCGTGCAGATCCACACCAGCGTGGTGGACAGCG pr o t op t_hml (313) 

20 ACTATAAAGGCGAAATTCAATTGGTTATTAGCTCTTCAATTCCTTGGAGTGC Protwtjiml (365) 

ACTACAAGGGCGAGATCCAGCTGGTGATCAGCAGCAGCATCCCCTGGAGCGC prot optjiml (365) 

CAGTCCAAGAGACAGGATTGCTCAATTATTACTCCTGCCATACATTAAGGGT Protwt_hml (417) 
CAGCCCCCGCGACCGCATCGCCCAGCTGCTGCTGCTGCCCTACATCAAGGGC pr ot opt_hral (417) 

GGAAATAGTGAAATAAAAAGAATAGGAGGGCTTGGAAGCACTGATCCAACAG Protwt_hml (469) . 
GGCAACAGCGAGATCAAGCGCATCGGCGGCCTGGGCAGCACCGACCCCACCG protoptjiml (4 69) 

GAAAGGCTGCATATTGGGCAAGTCAGGTCTCAGAGAACAGACCTGTGTGTAA Protwtjiml ( 521 ) 
30 GCAAGGCCGCCTACTGGGCCAGCCAGGTGAGCGAGAACCGCCCCGTGTGCAA protopt_hml (521) 

GGCCATTATTCAAGGAAAACAGTTTGAAGGGTTGGTAGACACTGGAGCAGAT Protwtjiml ( 573 ) 
GGCCATCATCCAGGGCAAGCAGTTCGAGGGCCTGGTGGACACCGGCGCCGAC prot opt Jiml (573) 

35 GTCTCTATCATTGCTTTAAATCAGTGGCCAAAAAATTGGCCTAAACAAAAGG Protwtjiml (625) 

GTGAGCATCATCGCCCTGAACCAGTGGCCCAAGAACTGGCCCAAGCAGAAGG protopt_hml ( 625) 

CTGTTACAGGACTTGTCGGCATAGGCACAGCCTCAGAAGTGTATCAAAGTAC Protwtjiml ( 677 ) 
4 q CCGTGACCGGCCTGGTGGGCATCGGCACCGCCAGCGAGGTGTACCAGAGCAC protopt Jiml ( 677 ) 

GGAGATTTTACATTGCT TAGGGCCAGATAATCAAGAAAGTACTGTT CAGCCA Prot wt Jiml (729) 
CGAGATCCTGCACTGCCTGGGCCCCGACAACCAGGAGAGCACCGTGCAGCCC prot opt_hml (729) 

ATGATTACTTCAATTCCTCTTAATCTGTGGGGTCGAGATTTATTAC7VACAAT Protwtjiml (781) 
45 ATGATCACCAGCATCCCCCTGAACCTGTGGGGCCGCGACCTGCTGCAGCAGT protopt_hml (781) 

GGGGTGCGGAAATCACCATGCCCGCTCCATCATATAGCCCCACGAGTCAAAA Protwtjiml (833) 
GGGGCGCCGAGATCACCATGCCCGCCCCCAGCTACAGCCCCACCAGCCAGAA protopt_hml ( 833 ) 

50 . AATCATGACCAAGATGGGATATATACCAGGAAAGGGACTAGGGAAAAATGAA Protwtjiml (885) 

GATCATGACCAAGATGGGCTACATCCCCGGCAAGGGCCTGGGCAAGAACGAG protoptjiml (885) 

GATGGCATTAAAATTCCAGTTGAGGCTAAAATAAATCAAGAAAGAGAAGGAA Protwtjiml ( 937) 
GACGGCATCAAGATCCCCGTGGAGGCCAAGATCAACCAGGAGCGCGAGGGCA protopt_hml ( 937 ) 



TAGGGAATCCTTGC TAG Protwtjiml (989) 

TCGGCAACCCCTGCGCTTAA protopt_hml (989) 



Pol manipulation: 

60 Start with SEQ ID 65 (SEQ ID 73); manipulate to SEQ ID 66 (SEQ ID 74): 

ATGAATAAATCAAGAAAGAGAAGGAATAGGGAATCCTTGCTAGGGGCGGCCA polwt Jiml ( 1 ) 
ATGAACAAGAGCCGCAAGCGCCGCAACCGCGAGAGCCTGCTGGGCGCCGCCA polopt Jiml ( 1 ) 

CTGTAGAGCCTCCTAAACCCATACCATTAACTTGGAAAACAGAAAAACCAGT polwt Jiml ( 53 ) 
65 CCGTGGAGCCCCCCAAGCCCATCCCCCTGACCTGGAAGACCGAGAAGCCCGT polopt Jiml (53) 

GTGGGTAAATCAGTGGCCGCTACCAAAACAAAAACTGGAGGCTTTACATTTA polwt Jiml ( 105 ) 
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GTGGGTGAACCAGTGGCCCCTGCCCAAGCAGAAGCTGGAGGCCCTGCACCTG poloptjiml { 105 ) 

TTAGCAAATGAACAGTTAGAAAAGGGTCATATTGAGCCTTCGTTCTCACCTT polwt_hml ( 157 ) 
CTGGCCAACGAGCAGCTGGAGAAGGGCCACATCGAGCCCAGCTTCAGCCCCT polopt_hml (157) 

GGAATTCTCCTGTGTTTGTAATTCAGAAGAAATCAGGCAAATGGCGTATGTT polwt_hml (209) 
GGAACAGCCCCGTGTTCGTGATCCAGAAGAAGAGCGGCAAGTGGCGCATGCT polopt_hml (209) 

AACTGACTTAAGGGCTGTAAACGCCGTAATTCAACCCATGGGGCCTCTCCAA polwt_hml (261) 
10 GACCGACCTGCGCGCCGTGAACGCCGTGATCCAGCCCATGGGCCCCCTGCAG polopt_hml (261) 

CCCGGGTTGCCCTCTCCGGCCATGATCCCAAAAGATTGGCCTTTAATTATAA polwt_hml (313) 
CCCGGCCTGCCCAGCCCCGCCATGATCCCCAAGGACTGGCCCCTGATCATCA poloptjiml ( 313 ) 

15 TTGATCTAAAGGATTGCTTTTTTACCATCCCTCTGGCAGAGCAGGATTGCGA polwtjiml (365) 

TCGACCTGAAGGACTGCTTCTTCACCATCCCCCTGGCCGAGCAGGACTGCGA polopt_hml ( 365 ) 



20 
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AAAATTTGCCTTTACTATACCAGCCATAAATAATAAAGAACCAGCCACCAGG pol wt hml (417) 
GAAGTTCGCCTTCACCATCCCCGCCATCAACAACAAGGAGCCCGCCACCCGC polopt_hml (417) 

TTTCAGTGGAAAGTGTTACCTCAGGGAATGCTTAATAGTCGAACTATTTGTC polwt_hml (469) 
TTCCAGTGGAAGGTGCTGCCCCAGGGCATGCTGAACAGCCCCACCATCTGCC polopt_hml (469) 

AGACTTTTGTAGGTCGAGCTCTTCAACCAGTTAGAGAAAAGTTTTCAGACTG pol wt_hml (521) 
25 AGACCTTCGTGGGCCGCGCCCTGCAGCCCGTGCGCGAGAAGTTCAGCGACTG polopt_hml (521) 

TTATATTATTCATTGTATTGATGATATTTTATGTGCTGCAGAAACGAAAGAT polwt^hml (573) 
CTACATCATCCACTGCATCGACGACATCCTGTGCGCCGCCGAGACCAAGGAC polopt_hml ( 57 3 ) 

30 AAATTAATTGACTGTTATACATTTCTGCAAGCAGAGGTTGCCAATGCTGGAC polwtjiml (625) 

AAGCTGATCGACTGCTACACCTTCCTGCAGGCCGAGGTGGCCAACGCCGGCC polopt_hml ( 625 ) 

TGGCAATAGCATCTGATAAGATCCAAACCTCTACTCCTTTTCATTATTTAGG polwt_hml (677) 
TGGCCATCGCCAGCGACAAGATCCAGACCAGCACCCCCTTCCACTACCTGGG polopt_hml (677) 

GATGCAGATAGAAAATAGAAAAATTAAGCCACAAAAAATAGAAATAAGAAAA polwtjiml (729) 
CAT GCAGAT CGAGAACCGCAAGATCAAGCCCCAGAAGAT CGAGATCCGCAAG poloptjiml (729) 

GACACATTAAAAACACTAAATGATTTTCAAAAATTACTAGGAGATATTAATT polwtjiml (7 81) 
40 GACACCCTGAAGACCCTGAACGACTTCCAGAAGCTGCTGGGCGACATCAACT polopt_hml (781) 

GGATTCGGCCAACTCTAGGCATTCCTACTTATGCCATGTCAAATTTGTTCTC polwtjiml (833) 
GGATCCGCCCCACCCTGGGCATCCCCACCTACGCCATGAGCAACCTGTTCAG polopt Jiml (833) 

45 TATCTTAAGAGGAGACTCAGACTTAAATAGTAAAAGAATGTTAACCCCAGAG polwtjiml (885) 

CATCCTGCGCGGCGACAGCGACCTGAACAGCAAGCGCATGCTGACCCCCGAG polopt Jiml (885) 

GCAACAAAAGAAATTAAATTAGTGGAAGAAAAAATTCAGTCAGCGCAAATAA polwtjiml (937) 
GCCACCAAGGAGATCAAGCTGGTGGAGGAGAAGATCCAGAGCGCCCAGATCA polopt_hml ( 937 ) 

ATAGAATAGATCCCTTAGCCCCACTCCAACTTTTGATTTTTGCCACTGCACA polwtjiml (989) 
ACCGCATCGACCCCCTGGCCCCCCTGCAGCTGCTGATCTTCGCCACCGCCCA polopt Jiml (989) 

TTCTCCAACAGGCATCATTATTCAAAATACTGATCTTGTGGAGTGGTCATTC polwt_hml (1041) 
55 CAGCCCCACCGGCATCATCATCCAGAACACCGACCTGGTGGAGTGGAGCTTC polopt Jiml (1041 ) 

CTTCCTCACAGTACAGTTAAGACTTTTACATTGTACTTGGATCAAATAGCTA polwt_hml (1093) 
CTGCCCCACAGCACCGTGAAGACCTTCACCCTGTACCTGGACCAGATCGCCA ■ polopt Jiml (1093) 

60 CATTAATCGGTCAGACAAGATTACGAATAATAAAATT^TGTGGGAATGACCC polwtjiml (1145) 

CCCTGATCGGCCAGACCCGCCTGCGCATCATCAAGCTGTGCGGCAACGACCC polopt Jiml (1145) 

AGACAAAATAGTTGTCCCTTTAACCAAGGAACAAGTTAGACAAGCCTTTATC polwtjiml (1197). 
CGACAAGATCGTGGTGCCCCTGACCAAGGAGCAGGTGCGCCAGGCCTTCATC polopt Jiml (1197) 

AATTCTGGTGCATGGAAGATTGGTCTTGCTAATTTTGTGGGAATTATTGATA polwtjiml ( 124 9) 
AACAGCGGCGCCTGGAAGATCGGCCTGGCCAACTTCGTGGGCATCATCGACA polopt Juni (124 9) 

ATCATTACCCAAAAACAAAGATCTTCCAGTTCTTAAAATTGACTACTTGGAT polwtjiml (1301) 
70 ACCACTACCCCAAGACCAAGATCTTCCAGTTCCTGAAGCTGACCACCTGGAT polopt Jiml (1301) 
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TCTACCTAAAATTACCAGACGTGAACCTTTAGAAAATGCTCTAACAGTATTT polwt_hml (1353) 
CCTGCCCAAGATCACCCGCCGCGAGCCCCTGGAGAACGCCCTGACCGTGTTC poloptjiml (1353) 

ACTGATGGTTCCAGCAATGGAAAAGCAGCTT ACACAGGACCGAAAGAACGAG pol wt__hml (14 05) 
ACCGACGGCAGCAGCAACGGCAAGGCCGCCT ACACCGGCCCCAAGGAGCGCG polop t_hml (1405) 

TAATCAAAACTCCATATCAATCGGCTCAAAGAGCAGAGTTGGTTGCAGTCAT polwtjiml (1457) 
TGATCAAGACCCCCTACCAGAGCGCCCAGCGCGCCGAGCTGGTGGCCGTGAT polopt_hml (14 57) 

TACAGTGTTACAAGATTTTGACCAACCTATCAATATTATATCAGATTCTGCA polwt Jhml (1509) 
CACCGTGCT GCAGGACTTCG ACCAGCCCATCAAC AT CATCAGCGACAGCGCC polopt_hml ( 1509) 

TATGTAGTACAGGCTACAAGGGATGTTGAGACAGCTCTAATTAAATATAGCA polwt_hml (1561) 
15 TACGTGGTGCAGGCCACCCGCGACGTGGAGACCGCCCTGATCAAGTACAGCA polopt_hml (1561) 

TGGATGATCAGTTAAACCAGCTATTCAATTTATTACAACAAACTGTAAGAAA polwt_hml (1613) 
TGGACGACCAGCTGAACCAGCTGTTCAACCTGCTGCAGCAGACCGTGCGCAA polopt_hml (1613) ^ 

20 AAGAAATTTCCCATTTTATATTACACATATTCGAGCACACACTAATTTACCA polwt_hml (1665) 

GCGCAACTTCCCCTTCTACATCACCCACATCCGCGCCCACACCAACCTGCCC polopt_hml (1665) 

GGGCCTTTGACTAAAGCAAATGAACAAGCTGACTTACTGGT-ATCATCTGCA polwtjiml (1717) 
GGCCCCCTGACCAAGGCCAACGAGCAGGCCGACCTGCTGGTGAGCAGC-GCC polopt Jiml (1717) 

CTCATAAAAGCACAAGAACT TCATG CTT TGACTC ATGTAAATGCAG CAGGAT polwt_hml (17 68) 
CTGATCAAGGCCCAGGAGCTGCACGCCCTGACCCACGTGAACGCCGCCGGCC poloptjiml (1768 ) 

TAAAAAACAAATTTGATGTCACATGGAAACAGGCAAAAGATATTGTACAACA polwt_hml (1820) 
30 TGAAGAACAAGTTCGACGTGACCTGGAAGCAGGCCAAGGACATCGTGCAGCA polopt_hml (1820) 

TTGCACCCAGTGTCAAGTCTTACACCTGCCCACTCAAGAGGCAGGAGTTAAT polwt_hml ( 1872 ) 
CTGCACCCAGTGCCAGGTGCTGCACCTGCCCACCCAGGAGGCCGGCGTGAAC polopt__hml (1872) 

35 CCCAGAGGTCTGTGTCCTAATGCATTATGGCAAATGGATGTCACGCATGTAC polwt_hml (1924) 

CCCCGCGGCCTGTGCCCCAACGCCCTGTGGCAGATGGACGTGACCCACGTGC polopt_hml (1924) 
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CTT CATT TGGAAGAT T ATCATATGTTCACGTAACAGTTGAT ACT TATTCACA polwt_hml (197 6) 
CCAGCTTCGGCCGCCTGAGCTACGTGCACGTGACCGTGGACACCTACAGCCA polopt Jiml (1976) 

TTTCATATGGGCAACTTGCCAAACAGGAGAAAGTACTTCCCATGTTAAAAAA polwt_htnl ( 2028 ) 
CTTCATCTGGGCCACCTGCCAGACCGGCGAGAGCACCAGCCACGTGAAGAAG polopt Jtiml (2028 ) 

CATTTATTGTCTTGTTTTGCTGTAATGGGAGTTCCAGAAAAAATCAAAACTG polwt_hral (2080) 
45 CACCTGCTGAGCTGCTTCGCCGTGATGGGCGTGCCCGAGAAGATCAAGACCG polopt_hml (2080) 

ACAATGGACCAGGATATTGTAGTAAAGCTTTCCAAAAATTCTTAAGTCAGTG polwt_hml (2132) 
ACAACGGCCCCGGCTACTGCAGCAAGGCCTTCCAGAAGTTCCTGAGCCAGTG polopt_hml (2132) 

50 GAAAATTTCACATACAACAGGAATTCCTTATAATTCCCAAGGACAGGCCATA polwtjiml (2184) 

GAAGATCAGCCACACCACCGGCATCCCCTACAACAGCCAGGGCCAGGCCATC polopt_hml (2184) 

GTTGAAAGAACTAATAGAACACTCAAAACTCAATTAGTTAAACAAAAAGAAG polwt_hml (2236) 
GTGGAGCGCACCAACCGCACCCTGAAGACCC AGCTGGTGAAGCAGAAGGAGG pol opt_hml (2236) 

GGGGAGACAGTAAGGAGTGTACCACTCCTCAGATGCAACTTAATCTAGCACT po 1 wt_hml (2288) 
GCGGCGACAGCAAGGAGTGCACCACCCCCCAGATGCAGCTGAACCTGGCCCT polopt_hml (2288) 

CTATACTT T AAATTT T T TAAACAT T T AT AGAAATC AGACTACTACT TC TGC A polwt_hml (2340) 
60 GTACACCCTGAACTTCCTGAACATCTACCGCAACCAGACCACCACCAGCGCC polop t_hml (2340) 

GAACAACATCTTACTGGTAAAAAGAACAGCCCACATGAAGGAAAACTAATTT polwt_hml (2392) 
GAGCAGCACCTGACCGGCAAGAAGAACAGCCCCCACGAGGGCAAGCTGATCT polopt_hml (2392) 

65 GGTGGAAAGATAATAAAAATAAGACATGGGAAATAGGGAAGGTGATAACGTG pol wt_hml (2444) 

GGTGGAAGGACAACAAGAACAAGACCTGGGAGATCGGCAAGGTGATCACCTG polopt_hml (2444) 

GGGGAGAGGTTTTGCTTGTGTTTCACCAGGAGAAAATCAGCTTCCTGTTTGG polwt_hml (24 96) 
GGGCCGCGGCTTCGCCTGCGTGAGCCCCGGCGAGAACCAGCTGCCCGTGTGG polopt_hml (24 96) 
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ATACCCACT AGACAT TTGAAGT TCT ACAATGAACCCATCAGAGATGCAAAGA 
ATCCCCACCCGCCACCTGAAGTTCTACAACGAGCCCATCCGCGACGCCAAGA 

AAAGCACCTCCGCGGAGACGGAGACATCGCAATCGAGCACCGTTGACTCACA 
AGAGCACCAGCGCCGAGACCGAGACCAGCCAGAGCAGCACCGTGGACAGCCA 

AGATGAACAAAATGGTGACGTCAGAAGAACAGATGAAGTTGCCATCCACCAA 
GGACGAGCAGAACGGCGACGTGCGCCGCACCGACGAGGTGGCCATCCACCAG 

GAAGGCAGAGCCGCCAACTTGGGCACAACTAAAGAAGCTGACGCAGTTAGCT 
GAGGGCCGCGCCGCCAACCTGGGCACCACCAAGGAGGCCGACGCCGTGAGCT 

ACAAAATATCTAGAGAACACAAAGGTGACACAAACCCCAGAGAGTATGCTGC 
ACAAGATCAGCCGCGAGCACAAGGGCGACACCAACCCCCGCGAGTACGCCGC 

TTGCAGCCTTGATGATTGTATCAATGGTGGTAAGTCTCCCTATGCCTGCAGG 
CTGCAGCCTGGACGACTGCATCAACGGCGGCAAGAGCCCCTACGCGTGCCGC 

AGC AGCT GCAGC TAA 

AGCAGCTGCAGCGCTTAA 

Env manipulation: 

Start with SEQ ID 81 (SEQ ID 83); manipulate to SEQ ID 82: 
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polwt_hml (2548) 
polopt_hral (2548) 

polwt_hml (2600) 
polopt_hml (2600) 

polwt_hml (2652) 
poloptjiml (2652) 

polwt_hml (2704) 
poloptjiml (2704) 

polwtjunl (2756) 
polopt_hml (2756) 

polwt_hml (2808) 
polopt_hml (2808) 

polwtjunl (2860) 
poloptjiml (2860) 



envwt_HML2 
envopt_HML2 

envwt_HML2 
envopt_HML2 

envwtJiML2 
envopt_HML2 

envwt_HML2 
envopt_HML2 

envwt_HML2 
envopt — HML2 

envwt_HML2 
envoptJlML2 

envwt_HML2 
envopt_HML2 

envwt_HML2 
envopt_HML2 

envwt_HML2 
envoptJlML2 

envwt_HML2 
envoptJHML2 

envwt_HML2 
envoptJiML2 

envwt_HML2 
envopt_HML2 

envwt__HML2 
envoptJiML2 



ATGAACCCAAGCGAGATGCAAAGAAAAGCACCTCCGCGGAGACGGAGACATCGCAATCGA 
ATGAACCCCAGCGAGATGCAGCGCAAGGCCCCCCCCCGCCGCCGCCGCCACCGCAACCGC 

GCACCGTTGACTCACAAGATGAACAAAATGGTGACGTCAGAAGAACAGATGAAGTTGCCA 
GCCCCCCTGACCCACAAGATGAACAAGATGGTGACCAGCGAGGAGCAGATGAAGCTGCCC 

TCCACCAAGAAGGCAGAGCCGCCAACTTGGGCACAACTAAAGAAGCTGACGCAGTTAGCT 
AGCACCAAGAAGGCCGAGCCCCCCACCTGGGCCCAGCTGAAGAAGCTGACCCAGCTGGCC 

ACAAAATATCTAGAGAACACAAAGGTGACACAAACCCCAGAGAGTATGCTGCTTGCAGCC 
ACCAAGTACCTGGAGAACACCAAGGTGACCCAGACCCCCGAGAGCATGCTGCTGGCCGCC 

TTGATGATTGTATCAATGGTGGTAAGTCTCCCTATGCCTGCAGGAGCAGCTGCAGCTAAC 
CTGATGATCGTGAGCATGGTGGTGAGCCTGCCCATGCCCGCCGGCGCCGCCGCCGCCAAC 

TATACCTACTGGGCCTATGTGCCTTTCCCGCCCTTAATTCGGGCAGTCACATGGATGGAT 
TACACCTACTGGGCCTACQTGCCCTTCCCCCCCCTGATCCGCGCCGTGACCTGGATGGAC 

AATCCTACAGAAGTATATGT T AAT GATAGTGTATGGGT ACCTGGCCCCATAGATG ATCGC 
AACCCCACCGAGGTGTACGTGAACGACAGCGTGTGGGTGCCCGGCCCCATCGACGACCGC 

TGCCCTGCCAAACCTGAGGAAGAAGGGATGATGATAAATATTTCCATTGGGTATCATTAT 
TGCCCCGCCAAGCCCGAGGAGGAGGGCATGATGATCAACATCAGCATCGGCTACCACTAC 

CCTCCTATTTGCCTAGGGAGAGCACCAGGATGTTTAATGCCTGCAGTCCAAAATTGGTTG 
CCCCCCATCTGCCTGGGCCGCGCCCCCGGCTGCCTGATGCCCGCCGTGCAGAACTGGCTG 

GTAGAAGTACCTACTGTCAGTCCCATCTGTAGATTCACTTATCACATGGTAAGCGGGATG 
GTGGAGGTGCCCACCGTGAGCCCCATCTGCCGCTTCACCTACCACATGGTGAGCGGCATG 

TCACTCAGGCCACGGGTAAATTATTTACAAGACTTTTCTTATCAAAGATCATTAAAATTT 
AGCCTGCGCCCCCGCGTGAACTACCTGCAGGACTTCAGCTACCAGCGCAGCCTGAAGTTC 

AGACCTAAAGGGAAACCTTGCCCCAAGGAAATTCCCAAAGAATCAAAAAATACAGAAGTT 
CGCCCCAAGGGCAAGCCCTGCCCCAAGGAGATCCCCAAGGAGAGCAAGAACACCGAGGTG 

TTAGTTTGGGAAGAATGTGTGGCCAATAGTGCGGTGATATTACAAAACAATGAATTCGGA 
CTGGTGTGGGAGGAGTGCGTGGCCAACAGCGCCGTGATCCTGCAGAACAACGAGTTCGGC 
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envwt_HML2 
envopt__HML2 

envwt_HML2 
envopt_HML2 

envwt_HML2 
envopt_HML2 

envwt_HML2 
envopt_HML2 

envwt_HML2 
envopt_HML2 

envwt__HML2 
envopt_HML2 

envwt_HML2 
envopt_HML2 

envwt_HML2 
envopt_HML2 

envwt_HML2 
envopt_HML2 

envwt_HML2 
envopt_HML2 

envwt_HML2 
envopt^HML2 

envwt_HML2 
envopt_HML2 

envwt_HML2 
envopt__HML2 

envwt_HML2 
envopt_HML2 

envwt_HML2 
envopt_HML2 

envwt_HML2 
envopt_HML2 

envwt_HML2 
envopt_HML2 

envwt_HML2 
envopt_HML2 

envwt_HML2 
envopt_HML2 

envwt_HML2 
envopt_HML2 



ACT ATTATAGAT TGG GCACCTCGAGGTC AATTCT ACCACAATT GCTCAGGACAAACTCAG 
ACCATCATCGACTGGGCCCCCCGCGGCCAGTTCTACCACAACTGCAGCGGCCAGACCCAG 

TCGTGTCCAAGTGCACAAGTGAGTCCAGCTGTTGATAGCGACTTAACAGAAAGTTTAGAC 
AGCTGCCCCAGCGCCCAGGTGAGCCCCGCCGTGGACAGCGACCTGACCGAGAGCCTGGAC 

AAACATAAGCATAAAAAATTGCAGTCTTTCTACCCTTGGGAATGGGGAGAAAAAGGAATC 
AAGCACAAGCACAAGAAGCTGCAGAGCTTCTACCCCTGGGAGTGGGGCGAGAAGGGCATC 

TCTACCCCAAGACCAAAAATAGTAAGTCCTGTTTCTGGTCCTGAACATCCAG7UVTTATGG 
AGCACCCCCCGCCCCAAGATCGTGAGCCCCGTGAGCGGCCCCGAGCACCCCGAGCTGTGG 

AGGCTTACTGTGGCTTCACACCACATTAGAATTTGGTCTGGAAATCAAACTTTAGAAACA 
CGCCTGACCGTGGCCAGCCACCACATCCGCATCTGGAGCGGCAACCAGACCCTGGAGACC 

AGAGATCGTAAGCCATTTTATACTATTGACCTGAATTCCAGTCTAACAGTTCCTTTACAA 
CGCGACCGCAAGCCCTTCTACACCATCGACCTGAACAGCAGCCTGACCGTGCCCCTGCAG 

AGTTGCGTAAAGCCCCCTTATATGCTAGTTGTAGGAAATATAGTTATTAAACCAGACTCC 
• AGCTGCGTGAAGCCCCCCTACATGCTGGTGGTGGGCAACATCGTGATCAAGCCCGACAGC 

CAGACTATAACCTGTGAAAATTGTAGATTGCTTACTTGCATTGATTCAACTTTTAATTGG 
CAGACCATCACCTGCGAGAACTGCCGCCTGCTGACCTGCATCGACAGCACCTTCAACTGG 

CAACACCGTATTCTGCTGGTGAGAGCAAGAGAGGGCGTGTGGATCCCTGTGTCCATGGAC 
CAGCACCGCATCCTGCTGGTGCGCGCCCGCGAGGGCGTGTGGATCCCCGTGAGCATGGAC 

CGACCGTGGGAGGCCTCGCCATCCGTCCATATTTTGACTGAAGTATTAAAAGGTGTTTTA 
CGCCCCTGGGAGGCCAGCCCCAGCGTGCACATCCTGACCGAGGTGCTGAAGGGCGTGCTG 

AATAGATCCAAAAGATTCATTTTTACTTTAATTGCAGTGATTATGGGATTAATTGCAGTC 
AACCGCAGCAAGCGCTTCATCTTCACCCTGATCGCCGTGATCATGGGCCTGATCGCCGTG 

ACAGCTACGGCTGCTGTAGCAGGAGTTGCATTGCACTCTTCTGTTCAGTCAGTAAACTTT 
ACCGCCACCGCCGCCGTGGCCGGCGTGGCCCTGCACAGCAGCGTGCAGAGCGTGAACTTC 

GTTAATGATTGGCAAAAAAATTCTACAAGATTGTGGAATTCACAATCTAGTATTGATCAA 
GTGAACGACTGGCAGAAGAACAGCACCCGCCTGTGGAACAGCCAGAGCAGCATCGACCAG 

AAATTGGCAAAT CAAATT AAT GAT CT T AG AC AAACTGT CAT TTGGATGGGAGACAGACTC 
AAGCTGGCCAACCAGATCAACGACCTGCGCCAGACCGTGATCTGGATGGGCGACCGCCTG 

ATGAGCTTAGAACATCGTTTCCAGTTACAATGTGACTGGAATACGTCAGATTTTTGTATT 
ATGAGCCTGGAGCACCGCTTCCAGCTGCAGTGCGACTGGAACACCAGCGACTTCTGCATC 

ACACCCCAAATTTATAATGAGTCTGAGCATCACTGGGACATGGTTAGACGCCATCTACAG 
ACCCCCCAGATCTACAACGAGAGCGAGCACCACTGGGACATGGTGCGCCGCCACCTGCAG 

GGAAGAGAAGATAATCTCACTTTAGACATTTCCAAATTAAAAGAACAAATTTTCGAAGCA 
GGCCGCGAGGACAACCTGACCCTGGACATCAGCAAGCTGAAGGAGCAGATCTTCGAGGCC 

TCAAAAGCCCATTTAAATTTGGTGCCAGGAACTGAGGCAATTGCAGGAGTTGCTGATGGC 
AGCAAGGCCCACCTGAACCTGGTGCCCGGCACCGAGGCCATCGCCGGCGTGGCCGACGGC 

CTCGCAAATCTTAACCCTGTCACTTGGGTTAAGACCATTGGAAGTACTACGATTATAAAT 
CTGGCCAACCTGAACCCCGTGACCTGGGTGAAGACCATCGGCAGCACCACCATCATCAAC 

CTCATATTAATCCTTGTGTGCCTGTTTTGTCTGTTGTTAGTCTGCAGGTGTACCCAACAG 
CTGATCCTGATCCTGGTGTGCCTGTTCTGCCTGCTGCTGGTGTGCCGCTGCACCCAGCAG 
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envwt HML2 CTCCGAAGAGACAGCGACCATCGAGAACGGGCCATGATGACGATGGCGGTTTTGTCGAAA 
envopt_HML2 CTGCGCCGCGACAGCGACCACCGCGAGCGCGCCATGATGACCATGGCCGTGCTGAGCAAG 

envwt HML2 AGAAAAGGGGGAAATGTGGGGAAAAGCAAGAGAGATCAGATTGTTACTGTGTCTGTGGCCTAA 
5 ' envopt_HML2 CGCAAGGGCGGCAACGTGGGCAAGAGCAAGCGCGACCAGATCGTGACCGTGAGCGTGGCCTAA 

IN VITRO EXPRESSION OF GA G SEQUENCES 

Three different gag-encoding sequences were cloned into the pCMVKm2 vector: 
(1) gag opt HML-2 (SEQ ED 54, including SEQ ID 62 and encoding SEQ ID 70 - Fig. 5). 
1 0 (2) gag opt PCAV (SEQ ID 80, including SEQ ID 77 and encoding SEQ ID 79 - Fig. 8). 
(3) gag wt PCAV (SEQ ID 53, including SEQ ID 76 and encoding SEQ ID 78 - Fig. 4). 

The vectors were used to transfect 293 cells in duplicate in 6-well plates, using the 
polyamine reagent Transit™ LT-1 (PanVera Corp, Madison WI) plus 2 ng DNA. 

Cells were lysed after 48 hours and analyzed by western blot using pooled mouse antibody 
15 against HML2-gag as the primary antibody (1:400), and goat anti-mouse HRP as the secondary 
antibody (1:20000). Figure 10 shows that 'gag opt PCAV (lane 2) expressed much more 
efficiently than 'gag wt PCAV (lane 3). Lane 1 (*gag opt HML-2') is more strongly stained than 
lane 2 ('gag opt PCAV*), but this could be due to the fact that the primary antibody was raised 
against the homologous HML-2 protein, rather than reflecting a difference in expression 
20 efficiency. To address this question, antibodies were also raised against the PCAV product and 
were used for Western blotting. Figure 1 1 A shows results using the anti-HML2 as the primary 
antibody (1:500), and Figure 11B shows the results with anti-PCAV (1:500). Each antibody 
stains the homologous protein more strongly than the heterologous protein. 

NUCLEIC ACW IMMUNIZATION 
25 Vectors of the invention are purified from bacteria and used to immunize mice. 

T CELL RESPONSES TO PC A V GAG 

CB6F1 mice were intramuscularly immunized with pCMVKm2 vectors encoding PCAV 
gag (Figures 4 & 8) and induction of gag-specific CD4+ and CD8+ cells were measured. 

Mice received four injections of 50f*g plasmid at week 0, 2, 4 and 6. These plasmids 
30 included the wild type gag sequence (SEQ ID 76). Mice were then split into two separate groups 
for further work. 

The first group of three mice received a further 50p,g of plasmid at 25 weeks, but this 
plasmid included the optimized gag sequence (SEQ ID 77). Eleven days later spleens were 
harvested and pooled and a single cell suspension was prepared for culture. Spleen cells (1 x 10 6 
35 per culture) were cultured overnight at 37°C in the absence ("unstimulated") or presence 
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("stimulated") of 1 x 10 7 plaque-forming units (pfa) of a recombinant vaccinia which contains 
the PCAV gag sequence ("rW-gag", produced by homologous recombination of cloning vector 
pSCl 1 [1 ] 6], followed by plaque purification of recombinant rWgag). Duplicate stimulated and 
unstimulated cultures were prepared. The following day Brefeldin A was added to block 
cytokine secretion and cultures were continued for 2 hours. Cultures were then harvested and 
stained with fluorescently-Iabeled monoclonal antibodies for cell surface CD8 and intracellular 
gamma interferon (IFN-?). Stained samples were analyzed by flow cytometry and the fraction of 
CD8+ cells that stained positively for intracellular IFN-? was determined. Results were as 
follows: 



Culture condition 



Unstimulated 



Stimulated 



Culture #1 



0.10 



1.51 



Culture #2 



0.14 



1.27 



Average 



0.12 



1.39 



1.27 



Difference 

An average of 1.27% of the pooled splenic CD8+ cells synthesized IFN-? in response to 
stimulation with rW-gag. This demonstrates that the DNA immunization induced CD8+ T ceils 
that specifically recognized and responded to PCAV gag. 

The second group of four mice received a further 50p.g of plasmid at 28 weeks, but this 
plasmid included the optimized gag sequence (SEQ ID 77). Twelve days later spleens were 
harvested. As a specificity control, a spleen was also obtained from a CB6F1 mouse that had 
been vaccinated with a pCMV-KM2 vector encoding HML2 env. 

Single cell suspensions from individual spleens were prepared for culture. Spleen cells 
(1 x 10 6 per culture) were cultured overnight at 37°C in the absence of stimulation or in the 
presence of 1 x 10 7 pfu rW-gag. As a specificity control, additional cultures contained another 
recombinant vaccinia virus, rW-HIVgpl60env.SF162 ('WV-HIVenv" - contains full-length 
env gene from SF162 isolate of HrV-1), which was not expected to cross-react with either gag or 
env from PCAV. 

Duplicate cultures were prepared for each condition. The following day Brefeldin A was 
added to block cytokine secretion and anti-CD28 antibody was added to co-stimulate CD4 T 
cells. Cultures were continued for 2 hours and then harvested and stained with fluorescently- 
labeled monoclonal antibodies for cell surface CD8 and CD4 and intracellular IFN-?. Stained 
samples were analyzed by flow cytometry and the fractions of CD8+CD4- and CD4+8- T cells 
that stained positively for intracellular EFN-? were determined. Results are shown in the 
following table, expressed as the % of stained cells in response to stimulation by either PCAV 
gag or HIV env during spleen culture, after subtraction of the average value seen with cells 
which were not stimulated during spleen culture: 
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Spleen culture 
stimulation 


Vector administered at 28 weeks 


PCAV gag 


PCAV gag PCAV gag 


PCAV gag 


PCAV env 


CD8 


PCAV gag 


1.32 


1.88 


3.00 


2.09 


0.13 


HIVenv 


0.04 


0.12 


-0.02 


0.23 


0.05 


CD4 


PCAV gag 


0.26 


0.17 ' 


0.40 


0.22 


-0.01 


HIVenv 


0.01 


-0.02 


-0.03 


0.01 


-0.02 



For the 4 mice that had been vaccinated with a vector encoding PCAV gag, therefore, the 
rW-gag vector stimulated 1.32% to 3.00% of CD8+ T cells to produce IFN-?. However, there 
were few CD8+ T cells (<0.23%) that responded to the irrelevant rW-HIVgpl60env vector. 
The CD8+ T cell response is thus specific to PCAV gag. Furthermore, the control mouse that 
5 was immunized with PCAV env had very few CD8+ T cells (0.13%) which responded to the 
vaccinia stimulation. 

Similarly, vaccination with PCAV gag, but not with PCAV env, induced CD4+ T cells 
specific for PCAV gag (0. 1 7% to 0.40%). 

DNA immunization with vectors encoding PCAV gag thus induces CD8+ and CD4+ T 
10 cells that specifically recognize and respond to the PCAV gag antigen. 

VIRUS-LIKE PARTICLES 

293 cells were fixed 48 hours after transient transfection with pCMV-gag, either from 
HML-2 or from PCAV, and inspected by electron microscopy (Figure 12). VLPs were produced 
in both cases, but these were mainly intracellular for PCAV and mainly secreted for HML-2. 

15 The assembly of viable VLPs from PCAV and HML-2 indicates that the gag protein has 

retained its essential activity even though the endogenous virus is "dormant" and might thus be 
expected to be subject to mutational inactivation. 

The above description of preferred embodiments of the invention has been presented by 
20 way of illustration and example for purposes of clarity and understanding. It is not intended to be 
exhaustive or to limit the invention to the precise forms disclosed. It will be readily apparent to 
those of ordinary skill in the art in light of the teachings of this invention that many changes and 
modifications may be made thereto without departing from the spirit of the invention. It is 
intended that the scope of the invention be defined by the appended claims and their equivalents. 
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SEQUENCE LISTING INDEX 



SEQ ID 


DESCRIPTION 


1-9 


Gag sequences 


10-14 


Prt sequences 


15-21 


Pol sequences 


22-28 


Env sequences 


29-31 


cORP sequences 


32-37 


PCAP sequences 


38-50 


Splice variants A-M sequences 


51 


pCMVKm2.cORPopt HML-2 (Figure 2) 


52 


pCMVKm2.pCAP5opt HML-2 (Figure 3) 


53 


pCMVKm2.gag wt PCAV (Figure 4) 


54 


pCMVKm2.gagopt HML-2 (Figure 5) 


55 


pCMVKm2.Protopt HML-2 (Figure 6) 


56 


pCMVKrn2.Polopt HML-2 (Figure 7) 


57-66 


Nucleotide sequences pre- and post-manipulation 


67 


Manipulated cORF 


68 


Manipulated PCAP5 


69&70 


Gag — pre- and post-manipulation 


71 &72 


Prt - — pre- and post-manipulation 


73&74 


Pol — pre- and post-manipulation 


75 


PCAV, from the beginning of its first 5' LTR to the end of its fragmented 3 1 LTR 


76 & 77 


PCAV Gag nucleotide sequences — pre-and post manipulation 


78 & 79 


PCAV Gag amino acid sequences — pre-and post manipulation 


80 


pCMVKm2.gagopt PCAV (Figure 8) 


81 


Wild-type env from HML-2 


82 


Optimized env from HML-2 


83 


Amino acid sequence encoded by SEQ IDs 81 & 82 



NB: < 

- SEQ IDs 1 to 9 disclosed in reference 1 as SEQ IDs 85, 91 , 97, 102, 92, 98, 103, 104 & 146 
5 - SEQ IDs 10 to 14 disclosed in reference 1 as SEQ IDs 86, 99, 105, 106 & 147 

- SEQ IDs 15 to 21 disclosed in reference 1 as SEQ IDs 87, 93, 100, 107, 94, 108 & 148 

- SEQ IDs 22 to 28 disclosed in reference 1 as SEQ IDs 88, 95, 101, 107, 96, 108 & 149 

- SEQ IDs 29 to 31 disclosed in reference 1 as SEQ IDs 89, 90 & 109 

- SEQ IDs 32 to 37 disclosed in reference I as SEQ IDs 10, 1 1, 12, 7, 8 & 9 
10 - SEQ IDs 38 to 50 disclosed in reference 1 as SEQ IDs 28-37, 39, 4 1 & 43 

- SEQ ID 75 disclosed in reference 3 as SEQ ID 1. 
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ABSTRACT 

A nucleic acid vector comprising: (i) a promoter; (ii) a sequence encoding a HML-2 
polypeptide operably linked to said promoter; and (Hi) a selectable marker. Preferred vectors 

5 comprise: (i) a eukaryotic promoter; (ii) a sequence encoding a HML-2 polypeptide downstream 
of and operably linked to said promoter; (iii) a prokaryotic selectable marker; (iv) a prokaryotic 
origin of replication; and (v) a eukaryotic transcription terminator downstream of and operably 
linked to said sequence encoding a HML-2 polypeptide. Vectors of the invention are particularly 
useful for expression of HML-2 polypeptides either in vitro (e.g. for later purification).or in vivo 

10 (e.g. for nucleic acid immunization). They are well suited to nucleic acid immunization against 
prostate tumors. A preferred HML-2 is PCAV, which is located in chromosome 22 at 20.428 
megabases (22ql 1 .2). 
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FIGURE 5 
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FIGURE 6 
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FIGURE 8 
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SEQUENCE L1STDMG 

SEQ ID H 

ATGGGGCAAACTAAMGTAAAATTAAAAGTAAAmTGCCTC^ 
AAAATCTmCMGCTATTTCAMTMTAGMCM^ 

GGAACTAAAACAAGCAGGTAGGAAGGGTAATATCATTCCACTTACAGTATGGAATGATTGGGCCATTATTAAAGCAGCTTTAGAACCATTTCAAACA 
GMGMGATAGCGTTTCAGTTTCTGATGCCCCTGGAAGCTGTATAATAGATTGTMTGAAAACACMGGAAAAAATCCCAGAAAGAMCGGMGGTT 
TACATTGCGAATATGTAGCAGAGCCGGTAATGGCTC^GTCMCGCAAAATGTTGACTATAATCAATTACAGGAGGTGATATATCCTGAAACGTTAAA 
ATTAGMGGAAMGGTCCAGAATTAGTGGGGCCATCAGAGTCTAMCCACGAGGCACMGTCCTCTTCCAGCAGGTCAGGTGCCTGTMCATTACM 
CCTCAAAA^AGGTTAMGAAMTMGAC^^ 

AGTATGGATATCCAGGAATGCCCCCAGCACCACAGGGCAGGGCGCCATACCCTCAGCCGCCCACTAGGAGACTTAATCCTACGGCACCACCTAGTAG 
A(^GGGTAGTAMTTACATGAAATTATTGATAMTCAAGAMGGAAGGA 

GGAGAAGGAGCCCMGAGGGAGAGCCTCCCACAGTTGAGGCCAGATACAAGTCTTTTTCGATAAAAAAGCTAA^ 

AGTATGGACCCMCTCCCCTTATATGAGGACATTATTAGATTCCATTGCTCATGGACATAGACTQVTTCCTTATGATTGGGAGATTCTGGCAAAATC 

GTCTCTCTCACCCTCTCAATTTTTACAATTTMGACTT 

AACATAGATGCAGATCMCTATTAGGMTAGGTCAAMT^ 

CTATCTGCCTTAGAGCCTGGGAAAAAATCCMGACCCAGGAAGTACCTGCCCCTCATTTAATACAGTAAGACAAGGTTCAAAAGAGCCCTATCCTGA 

TTTTGTGGCMGGCTCCMGATGTTGCTCAAMGTCMTTGCTGA^ 

CCTGAGTGTCAATCAGCCATTMGCCATTAAMGGAMGGTTCCT^ 

GAGCTATGCATAAAGCTATGCTTATGGCTC^^ 

TCAMTTGGTCACTTAAAAMGMTTGCCCAGTCT^ 

CCMGATGTAAAAMGGAAMCATTGGGCTAGTCAA^ 

CTCAGGCCCCACMCAAACTGGGGCATTCCCMTTCAGCCATT^ 

AATAAGCCAGTTACCACAATACAACAATTGTCCCCCGCCACAAGCGGCAGTGCAGCAG 

SEQ ID 2 

ATGGGGCAAACTAAAAGTAAAATTAAAAGTAAATATGCCTCTTATCTCAGCTTTATTAAAATTCTTTTAAAAAGAGGGGGAGTTAAAGTATCTACAA 
AAAATCTMTCMGCTATTTCAMTAATAGAACMTTTTGCCCATGG 

GGAACTAAMCMGCAGGTAGGMGGGTAATATCATTCCACTTACAGTATGGAATGATTGGGCCATTATTAM 
GMGMGATAGCGTTTCAGTTTCTGATGCCOT 

TACATTGCGAATATGTAGCAGAGCCGGTAATGGCTCAGTCAACGCAAAATGTTGACTATAATCAATTACAGGAGGTGATATATCCTGAAACGTTAAA 

ATTAGAAGGAAAAGGTCCAGAATTAGTGGGGCCATCAGAGTCTAAACCACGAGGCACAAGTCCTCTTCCAGCAGGTCACGTGCCTGTAACATTACAA 

CCTCAAAAGCAGGTTAAAGAAMTMGACCCMCCGCCAGTAGCCTATCMTACTGGCCTCCGGCTGAACTTCAGTATCGGCCA 

AGTATGGATATCCAGGAATGCCCCCAGCACCACAGGGCAGGGCGCCATACCCTCAGCCGCCCACTAGGAGACTTAATCCTACGGCACCACCTAGTAG 

ACAGGGTAGTAAATTACATGAMTTATTGATAMTCMGAMGGAAGGAGATACTGAGGC^TGGCAATTCCCAGTMCGTTAGAACCGATGCCACCT 

GGAGMGGAGCCCMGAGGGAGAGCCTCCCACAGTTGAGGCCAGATACAAGTCTTTTTCGATAAAAAA 

agtatggacccaactccccttatatgaggacattattagattccattgctcatggacatagactcattccttatgattgggagattctggcaaaatc 
gtctctctcaccctctcaatttttacmtttmgacttggtggattgatggggtacaagaacaggtccgaagaaatagggctgccaatcctccagtt 
mcatagatgcagatcmctattaggmtaggtca^ 

ctatctgccttagagcctgggaaaamtccmgacccaggaagtacctgcccctcatttaatacagtaagacaaggttcaaaagagccctatcctga 

ttttgtggcaaggctccaagatgttgctcaamgtcmttgctgatga 

cctgagtgtcmtcagccattmgcc^ttaaaaggaaaggttcctg^ 

gagctatgtatamgctatgcttatggctcmgcmtaacaggagttgttttaggaggacmgttagaacatttggaagaaaatgttataattgtgg 

tcaaattggtcacttaaaaaagaattgcccagtcttaaataaacagaatataactattcaagcaactacaacaggtagagagccacctgacttatgt 

ccmgatgtaaaaaaggaamcattgggctagtcmtgtcgttctamtttgataaamtgggcmc 

ctcaggccccacmcamctggggcattcccmttcagcc^^ 

'aataagccagttaccacaatacaacaattgtcccccgccacaagcggcagtgcagcagtag 

SEQ ID 3 

atggggcamctaaaagtaaaactaaaagtamtatgot 

aaaatctmtcaagctatttcaaatmtagaacaatttt 

<&aactaaaacaagcaggtagaaagggtaatat^ 

aaagmgatagcgtttcagtttctgatgcccctggaagctgtgtmtagattgtmtgaaaagacagggagaamtcccag^ 

tacattgccmtatgtmcagagccagtaatggct^ 

attagaaggaamggtccagmttagtggggccatcagagtc^ 

cctcaaacgcaggttamgaamtaagacccmccg^ 

agtatggatatccaggaatgcccccagc^ctacagggcagggcgccatatcctcagccgcccactgtgagacttaatc 
acmggtggtacactgcacgcagtcattgatgmgccagaamcagggagatcttgaggcatggcggttcctggtaatttt 
ggggaagagactcmctaggagcgcctgcccgagctgagactagatgtgmcctttcaccatgaaaatgtta 
aatatggatccaactccccttatataagaacattattagattccattgctcatggaaatagacttactccttatgactgggaaagtttggccaaatc 
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TTCCCTTTCATCCTCTCAGTATCTACAGTTTAAAACCTGGTGGATTGATGGAGTACAAGAACAGGTACGMAAAATCAGGCTACTAAGCCCACTGTT 
AATATAGACGCAGACCAATTGTTAGGAACAGGTCCAAATTGGAGCACCATTAACCAACAATCAGTGATGCAGAATGAGGCTATTGAACAAGTAAGGG 
CTATTTGCCTCAGGGCCTGGGGAAAAATT(^GGACCCAGGAACAGCTTTCCCTATTMTTCAATTAGACMGGCTCT 

TGTGGCMGATTACMGATGCTGCTCAAAAGTCTATTACAGATGACMTGCCCGAAAAGTTATTGTAGAATTAATGGCCTATGAAAATGCAAATCCA 

GMTGTCAGTCGGCCATAAAGCCATTAAMGGAAAAGTTCCAGCAGGAGTTGATGTMTTACAGMTATGTGMGGCTTGTGATGGGATTGGAGGAG 

CTATGCATAAGG(^TGCTMTGGCTCMGCAATGAGGGGGCTCACTCT^^ 

AATCGGTCATCTGAAAAGGAGTTGCCCAGTCTTAAAm 

CCAAMTGTGGAAAAGGAAMCATTGGGCCMTCMTGTCATTCTAMTTTGATAMGATGGGCMCCATTGTCGGGAMCAGGAAGAGGGGCCAGC 

CTCAGGCCCCCC^CAMCTGGGGCATTCCCAGTTCMCTGTTTGT^ 

GGGAGTCAGCCAATTACAACAATCCAACAGCTGTCCCGCGCCACAGCAGGCAGCGCCACAGTAG 

SEQ ID 4 

ATGGGGCAMCTAAAAGTAAMTTAAMGTAAATATGCCTCTTATCTCAGC 

AAMTCTAATCAAGCTATTTCAAATAATAGMCMTXTTGCCCATGGTTTCCAGAACAAGGAACTTCAGATCTAAAAGATTGGAAAAGAATTGGTAA 
GGAACTAAMCAAGCAGGTAGGMGGGTMTATCATTCCACTTACAGTATG 

GAAGAAGATAGCATTTCAGTTTCTGATGCCCCTGGMGCTGTTTMTAGATTGTAATGAAMCACMGGAAAAAATCCCAGAAAGAAA 
TACATTGCGMTATGTAC^AGAGCCGGTMTGGOT 

ATTAGAAGGAAAAGGTCCAGMTTMTGGGGCC^TCAGAGTCTAMCCACGAGGCACMGTCCTCTTCCAGCAGGTCAGGTGCTC 
CCTCAAMGCAGGTTAMGAAAATMGACCCAACCGCAAG 

GTATGGATATCCAGGMTGCCCCCAGCACCACAGGGCAGGGCGCCATACCATCAGCCGCCCACTAGGAGACTTMTCCTATGGCACCACCTAGTAG^ 

CAGGGTAGTGAATTAC^TGJWVTTATTGATAAATCAAGAAAGGAAGGAGATACTGAGGCATGGCAATTCCCAGTAA 

SEQ ID 5 

MGQTKSKIKSKYASYLSFIKILLKRGGVKVSTKNLIKLFQIIEQFCPW 
EEDSVSVSDAPGSCIIDCNENTRKKSQKETEGLHCEYVAEPVMAQSTQNVDYN^ 

PQKQVKENKTQPPVAYQYWPPAELQYRPPPESQYGYPGMPPAPQGRAPYPQPPTRRLNPTAPPSRQGSKLHEIIDKSRKEGDTEAWQFPVTLEPMPP 
GEGAQEGEPPWEARYKSFSIKKLKDMKEGVKQYGPNSPYMRTLLDSIMGHRLIPYDWE 

NI DADQLLGIGQNWSTISQQALMQNEAIEQVRAICLRAWEKIQDPGSTCPS FNTVRQGSKEPYPDFVARLQDVAQKS IADEKARKVIVELMAYENAN 
PECQSAIKPLKGKVPAGSDVISEYVKACDGIGGAMYKAMIMAQAITGVVLGGQVRtFGRKCYNCGQIGHLKKNCPV^ 
PRCKKGKHWASQCRSKFDKNGQPLSGNEQRGQPQAPQQTGAFPIQPFVPQGFQGQQPPLSQVFQGISQLPQYNNCPPPQAAVQQ 
SEQ ID 6 

MGQTKSKTKSKYASYLSFIKILLKRGGVRVSTKNLIKL 

KEDSVSVSDAPGSCVIDCNEKTGRKSQKETESLHCEYVTEPVMAQSTQNVDYNQI,QGVIYPETLKLEGKGPELVGPSESKPRGPSPLPAGQVPVTLQ 

PQTQVKENKTQPPVAYQYWPPAELQYLPPPESQYGYPGMPPALQGRAPYPQPPTVRLN^ 

GEETQVGAPARAETRCEPFTMKMLKDIKEGVKQYGSNSPYIRTLLDSIMGNRLTPY^ 

NIDADQLLGTGPtWSTINQQSVMQNEAIEQVRAICLRAWGKIQDPGTAFPINSIRQGSKEPYPDFVARLQDMQKSITDDNARKVIV 
ECQSAIKPLKGKVPAGVDVITEYVKACDGIGGAMHKAMLMAQAM^^ 

PKCGKGKHWANQCHSKFDKDGQPLSGNRKRGQPQAPQQTGAFPVQLFVPQGFQGQQPLQKIPPLQGVSQLQQSNSCPAPQQAAPQ 
SEQ ID 7 

MGQTKSKIKSKYASYLSFIKILLKRGGVKVSTKNLIKLFQnEQ 

EEDS I S VSDAPGSCL IDCNENTRKKSQRETESLHCEYVAEPVMAQSTQWDYNQ 

PQKQVKEMTQPQVAYQYCRWLNFSIGHPQKVSMDIQECPQM 

SEQ ID 8 

MPPAPQGRAPYHQPPTRRLNPMAPPSRQGSELHEIIDKSRKEGDTEAWQFPVTLEP 
PYMRTLLDSIAYGHRLIPYDWEILAKSSLSPSQFLQFKTWWIDG^ 

WEKIQDPGSTCPSFNWRQGSKEPYPDFVARLQDVAQKSIADEKAGKVIVELMYENMPECQSAIKPLKGKVPAGSDVISEYVKACDGIGGAMHKA 
MimQAITGWLGGQVRTFGGKCYNCGQIGHLKKNCPVLNKQNITIQATTTGREPPDLCPRCKKGKHWASQCRSKFDKNGQPLSGNEQRGQPQAPQQ 
TQAFPIQPFVPQGFQGQQPPLSQVFQGISQLPQYNNCPSPQAAVQQ 

SEQ ID 9 

MGQTKSRIKSKYASYLSFIKILLKRGGVKVSTKNLIKLFQIIE 

EEDSVSVSDAPGSCIIDCNENTGKKSQKETEGLHCEYVAEPVMAQSTQNVDYNQLQEVIYPETLKLEGKGPELVGPSESKPRGTSPLPAGQVPVTLQ 

PQKQVKENKTQPPVAYQYWPPAELQYRPPPESQYGYPGMPPAPQGRAPYPQPPTRRLNPTAPPSRQGSKLHEIIDKSRKEGDTEAWQFPVTLEPMPP 

GEGAQEGEPPTVEARYKSFSIKKLKDMKEGVKQYGPNSPYMRTLLDSIAHGHRLIPYDWEIQAKSSLSPSQFLQFKTWWIDGVQEQVRRNRAANP 

NIDADQLLGIGQNWSTISQQALMQNEAIEQVRAICLRAWEKIQDPGSTCP 

PECQSAIKPLKGKVPAGSDVISEYVKACDGIGGAMHKAMLMAQAIT^ 

PRCKKGKHWASQCRSKFDKNGQPLSGNEQRGQPQAPQQTGAFPIQPFVPQGFQGOQPPI»SQVFQGISQIiPQYNNCPPPQAAVQQ 
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SEQ ID 10 

ATGGGCAACCATTGTCGGGAAACGAGCAMGGGGCCAGCCTCAGGCCCCA^ 
TraOBGACMCMCCCCCACTGTCCCAAGTG^ 

TAGATXTATGTACTATACAAGCAGTCTCTCTGCTTCCAGGGGAGCCCCCACAAAAAACCCCCACAGGGGTATATGGACCCCTGCCTAAGGGGACTGT 

AGGACTAATCTTGGGACGATCMGTCTAMTCTAAAAGGAGTTCAAATTCATACTAGTGTCKSTTGATO 

ATTAOTCTTCMTTCCTTC&A^^ 

GAATAGGAGGK3CTTGGAAGCACTGATCCAACAGGAAAG(^TGCAT^^ 

AGGAAMCAGTTTGMGGGTTGGTAGACACTGGAGCAGATGTCTCTATC 

ACAG^CTTGTCGGCATAGTCACAGOT^ 

CMTGATTACTTCMTTCCTCTTMTCTGTGGGGTCGAGATTTATTACAACAATGGGGTGCGGAAATCACCATGCCCGCTCCATCATATAGCCCCAC 

C^GTCAAAAMTCATGACCAAGATGGGATATATACCAGGAMGGGACTAGGGAAAMTGAA^ 

CAAGAAAGAGAAGGAATAGGGAATCCTTGC 

SEQ HD 11 

ATGGGCTUICCATTGTCGGGAAACTIGGAAGAGGGGCC^GCCTCAGGC^ 

TCMGGACAACMCCCCTACAGAAMTACCACCACTTCAGGGAGTCA^^ 

CAGTAGATTTATGTTCCACCCAMTGGTCTCTTTACTCCCT^^ 

GGTAGGCCTTATTTTAGGGAGATCAAGTCTAAATTTGMGGGAGTCCAAATTCATACTGGGGTMTTTATTCAGATTATAAAGGGGGAATTCAGTTA 
GTGATCAGCTCCACTGTTCCCTGGAGTGCCAATCCAGGTGAra 

AAAGAACAGGAGGGTTTGGAAGTACCAACCCTGCAGGAAAAGCCACTTATTGGGCTAATCAGQTCTCAGAGGATAGACCCGTGTGTACAGTCACTAT 
TCAGGGAAAGAGTTTGMGGATTAGTGGATACCCAGGCTGATGTTTCTATCATCGGCATAGGCACCGCCTCAGAAGTGTATCAAAGTGCCATGATTT 
TACATTGTCTAGGATCTGATAATCMGAAAGTACGGTTCAGCCTATGATCACTTCTATTCCMTCMTTTAT^GGCCGAGACTTGTTACMCMTG 
GCATGCAGAGATTACTATCCCAGCCTCCCTATAra^^ 

MTCMGATGGCATTAAAGTCCCAACTGAGGCTGAAAAAMTCAAAAAMGAAAAGGAATAGGG 
CAAAACCCATTCCATTAATTTGGGGGGAAAAAAA 

SEQ ID 12 

ATGGAGATTTTACATTGCTTAGGGCCAGATMTCAAGAMGT^ 

TACMCMTGGGGTC^GGAMTCACCATGCCCGCTCCATTATATAGCCCCACGAGTCAAAm 
ACTAGGGAAmTGAAGATGGCATTAMGTTCCAGTTGAGGCTAAMTAAATCMGAMGAGAAGGMTA^ 
SEQ ID 13 

MEILHCLGPDNQESTVQPMITSIPLNLWGRDLLQQWC^ 
SEQ ID 14 

WATIVGKRAKGPASGPTTNWGIPNSAICSSGFSGTTTPTVPSVSGNKPVTTIQQLSPATSGSAAVDLCTIQAVSLLPGEPPQKTPTGVYGPLPKGTV 

GLILGRSSLNLKGVQIHTSWDSDYKGEIQLVISSSIPWSASPR 

GKQFEGLVDTGADVSIIALNQWPKNWPKQKAVTGLVGIGTASE^ 

SQKIMTKMGYIPGKGLGKNEDGIKIPVEAKINQEREGIGNPC 

SEQ ID 15 

ATGG^TTAAMTTCCAGTTGAGGCT^^ 
ATACCATTMCTTGGAAMCAGAAAAACCA^ 

AGTTAGAAAAGGGTCATATTGAGCCTTCGTTCTCACCTTGGMTTCTC^ 
CTTMGGGCTGTAMCGCCGTMTTCMCCCATGGGGCCT^ 

ATTGATCTAAAGGATTGCTTTTTTACCATCCCTCTGGCAGAGCAGGATTGCGAAAAATTTGCCTrTACTATACCAGCCATAAATAATAAAGAACCAG 
CCACCAGGTTTCAGTGGAAAGTGTTACCTCAGGGAATGCTTMTAGTCCAACTATTTGTCAGACTTTTGTAGGTCGAGCTCTTCAACCAGTTAGAGA 
AAAGTTTTCAGACTGTTATATTATTCATTGTATTGATGATATTTTATGTGCTGCAGAAACGAMGATAMTTMTTGACTGTTATACATTTCTGCAA 

gcagaggttgccaatgctggactggcaatao^tctgataagatccamcctctactccttttcattatttagggatgca 
ttmgccacaaaaaatagmtmgaamgacacattaaamcactamtgattttcaaaaattactaggagatattm 
aggcattcctacttatgccatgtcamtttgttctctatcttaagaggagactcagacttamtagtaaaagaatgttmccccagaggcaacaam 
gamttamttagtggaagaaaaaattcagtcagc^ 

ctccmcaggcatcattattcaamtactgatcttgtggagtggtcattccttcctcacagtacagttaagacttttaca™ 

agctacattaatcggtcagacaagattacgaataataaaattatgtgggaatgacccagacaaaatagttgtccctttaaccaaggaacaagttaga 

cmgcctttatcaattctggtgcatggaagattggtc 

TAAMTTGACTACTTGGATTCTACCTAAAATTACCAGACGTGMCCTTTAGAAMTGCTCTMCAGTATTTACTGATGGTTCCAGCAATGGAAAAGC 
AGCTTACACAGGACCGAAAGMCGAGTMTCAAMCTCCATATCAATCGGCTCAAAGAGCAGAGTTGGTTGCAGTCATTACAGTGTTACAAGATTTT 
GACCAACCTATCMTAraATATCAGATTCTGCATATGTAGTACAGG 

TAAACCAGCTATTCAATTTATTACAACAAACTGTAAGAAAAAGAAATTTCCCATTTTATATTACACATATTCGAGCACACACTAATTTACCAGGGCC 
TTTGACTAAAGCAAATGAACAAGCTGACTTACTGGTATCATCTGCACTCATAAAAGCACAAGAACTTCATGCTTTGACTCATGTAAATGCAGCAGGA 
TTAAAAMCAAATTTGATGTCACATGGAAACAGGCAAMGATATTGTACAACA 

GAGTTAATCCCAGAGGTCTGTGTCCTMTGCATTATGGCAMTGGATGTCACGCATGTACCTTCATTTGGAAGATTATCATATGTTCACGTAACAGT 
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TGATiOTTATTCAraTTTCATATC^ 

GTTCCAGAAAAAATCAAMCTGACMTGGACCAGGATATTGTAGTAM 

ttccttataattcccaaggacaggcca^ 

gtgtaccactcctcagatgcmcttaatctag^^ 

cttactggtaaaaagaacagcccacatgaaggaamctmtttggtggaaagataataaam 

ggagaggttttgcttgtgtttcaccaggagaaaatc^gcttcctgtt^ 

amgaamgcacctccgcggagacggagacatcgcmtcgagc^^ 

gccatccaccmgaac&cagackcgccmcttg^ 

accccagagagtatgctgctt<kagccttgatgat^^ 

SEQ ID 16 

atgttaactgacttmgggctgtamcgccg™^ 

ctttaattatmttgatctamggattgcttttttaccatccctctggcagagcaggattgcgaaaaatttgcctttactataccagccataaataa 

taaagmcc^gccaccaggtttcagtggamgtgttacctcagggmtgcttmtagtccmctatttgtcagact^ 

ccagttagagaaaagttttcagactgttatattattcat^ 

catttctgcaagcagaggttgccmtgctggactggcmtagcatctgatmgatccaaacctctactccttttcattatttagg^ 
amtagaaaaattaagccacaaaaaatagamtmgaaaaga^^ 
cggcc^ctctaggcattcctacttatgc^^ 
aggcmovamgaaattaaattagtggmgaaaaaattcagtcagcgcaaa 
cactgcacattctccmcaggcatcattattc^ 
ttggatcamtagctacattmtcggtc^^^ 
mcmgttagac^gcctttatcmttctggtg^ 
• cttccagttcttaamttgactacttggattc^ 
mtggaamgcackttacaraggaccgamg 
tacmgattttgaccaacctatcaatattatatc^^ 
ggat(^tcagttamccagctattcaatttat^^ 
ttaccagggcctttgactamgcamtgmcaaotg^ 
atgcagcaggattaaaamcaaatttgatgtcacat^^ 
tcmgaggcaggagttmtcccagaggtctgtgtot 
cacgtmcagttgatacttattcacatttcatatgggcaact^ 

ctgtmtgggagttccagaaaaaatcaaaactgacaatggaccaggatattgtagtaaagctttccaaaaattcttaagtcagtggaaaatttcaca 
tacmcaggaattccttatmttcccmggacaggccatagttgamgaactaatagmcactcaamctcmttagttaaacaaaaagaaggggga 

GACAGTMGGAGTGTACCACTCCTCAGATGCAACTTMTCTAG^ 
CAGAACAACATCTTACTGGTAAAAAGMCAGCCCACAT^ 

GATAACGTGGGGGAGAGGTTTTGCTTGTGTTTCACCAGGAGAAAATCAGCTTCCTGTTTGGATACCCACTAGACATTTGAAGTTCTACAATGAACCC 
ATCAGAGATGCAMGAAAAGCACCTCCGCGGAGACGGAGACATCGCMTCGAGCACCGTTGACTCACAAGATGAACAAAATGGTGACGTCAGAAGM 
CAGATGAAGTTGCCATCCACCMGMGGCAGAGCCGCC^^ 

AGGTGACACAAACCCCAGAGAGTATGCTGCTTGCAGCCTTGATGATTGTATCMTGGTGGTMGTCTCCCTATGCCTGC^GGAGCAGCTGCAGCTAA 
SEQ ID 17 

ATGGCATTAMGTCCCMCTGAGGCTGAAAAAMTCA^ 

CATTCCATTMTTTGGGGGGAAAAAAAAMCTGTATGGTAAAT^ 

CAGTTAGAAAAAGGACATATTGAGCCTTCATTTTCGCCTTGGMT^ 

TTAAGAGCCATTAATGCCATAATTCAACCCATGGGGGCTCTCCCATCCCGGTTGCCCTCTCCAGCCATGGTCCCCTTTAATTATAATTGATCTGAAG 
GATTGCTTTTTTACCOTCCTCTGGCAAM^^ 

GTGGAAAGTATTGCCTCAGGGAATGCTTAATAATTCAACTATTTGTCAGACTTTCATAGCTCAAGCTCTGCAACCAGTTAGAGACAAGTTTTCAGAC 

TGTTATATCGTTCATTATGTTGATATTTTGTGTGCTGCAGAAACGAGAGACAAATTAATTGACCGTTACACATTTCTCAGACAGAGGTTGCCAACGC 

GGGACTGACMTAGCATCTGATMGATTCAMCCTCTCCTCCTTTCCATTACTTGGGMTGCAGGTAGAGGAMGGAAMTTAAACCACAAAAAATA 

GAMTAAGAAMGACACATTAAAAACATTAAATGAGTTTCAAAAGTTGGTAGGAGATACTAATTGGATTCGGAGATATTAATTGGATTTGGCCAACT 

CTAGGCATTCCTACTTATGCCATGTCMTTTTGTTCTCTTTCTTMGAGGGGACTTGGMTTAAATAGTGAAAGMTGTTACCTCCAGAGGCAACTA 

AAGAAATTAMTTMTTGAAGAAAAAMTTCGGTCAGCACMGTAMTAGGATC^ 

TCTM(^GCCATCATTGTTC^AMCACAGATC^ 

GCTACATTAATTGGTCAGGGAAGATTACGAATAATAACATTGTGTGGAAATGACCCAGATAAAATCACTGTTCCTTTCAACAAGCAACAAGTTAGAC 
AAGCCTTTATCAGTTCTGGTGCATGGCAGATTGGTC^^ 

AAAATTGACTACTTGGATTCTACCTAAAATTACCAGACGTGAACCTTTAGAAAATGCTCTAACAGTATTTACTGATGGTTCCAGCAATGGAAAAGCG 
GCTTACACAGGGCCGAAAGAACGAGTAATCAAAACTCCGTATCAATCAGCTCAAAGAGCAGAGTTGGTTGCAGTCATTACAGTGTTACAAGATTTTG 
ACCAACCTATCAATATTATATCAGATTCTGCATATGTAGTACAGGCTACAAGGGATGTTGAGACAGCTCTAATTAAATATAGCACGGACGATCATTT 
mCCAGCTATTCMTTTATTACMCAMCTGTAAGAAAMGAMTTTCCCATTTTATATTACTCATAT 

TTGACTAAAGCAAATGAACAAGCTGACTTACTGGTATCATCTGCATTCATAAAAGCACAAGAACTTCTTGCTTTGACTCATGTAAATGCAGCAGGAT 
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TAAAAAACAAATTTGATGTCACATGGAAACAGGCAAMGATATTGTACMCATT^ 

AGTTAATCCCAGAGGTCTGTGTCCTAATGCGTTATGGCAAATGGATGGCACGCATGTTCCTTCATTTGGAAGATTATCATATGTTCATGTAACAGTT 

GATACTTATTCACATTTCATATGGGCMCTTGCCAMCAGGAGAAAGTACTTCCCATGTTAAAAAACATTTATTATCTTGTTTTGCTGTAATGGGAG 

TTCCAGAAAAMTCAAAACTGACMTGGACCAGGATATTGTAGTAAAGCTT^ 

TCCTTATAATTCCCMGGACAGGCCATAGTTGAAAGM^ 

TGTACCACTCCTCAGATGCAACTTAATCTAGCACTCTATACTTTAAATTTTTTAAACATTTATAGAAATCAGACTACTACTTCTGCAAAACAACATC 
TTACTGGTAAAMGCACAGCCCACATGMGGAAMCTMTTTGGTC^ 

GAGAGGTTTTGCTTGTGTTTCACCAGGAGAAAATCAGCTTCCTGTTTGGATACCCACTAGACATTTGAAGTTCTACAATGAACCCATCGGAGATGCA 
AAGAAAAGGGCCTCCACAGAGATGGTAACCCCAGTCACATGGATGGATAATC 

SEQ ID 18 

ATGGGGCCTCTCCMCCCGGGTTGCCCTCTCCGGCCATGATCCCAAAAGATTGGCCTTTAATTATAATTGATCTAAAGGATTGCTTTTTTACCATCC 

CTCTGGCAGAGCAGGATTGTGAAAMTTTGCCTTTACTATACCAGCCATJ^ 

GGGAATGCTTAATAGTCCAACTATTTGTCAGACTTTTGTAGGT^ 

ATTGATGATATTTTATGTGCTGCAGAMCGAAAGATAAATTMTTGACTGTTATACATTTCTGCM 
CATCCGATMGATCCAAACCTCTACTCCTTTTCATTATTTAGG^^ 

C^CATTAAAAACACTAAATGATTTTCAAAMTTACTAGGAGATATTAATTGGATTCGGCCMCTC^^ 

TTCTCTATCTTAAGAGGAGACTCAGACTTAAATAGTCAAAGAATATTMCCCCAGAGGCMCAAMGAMTTAAATTAGTGGAAGAAAAAATTCAGT 
CAGCGCAMTAAATAGMTAGATCCCTTAGCCCCACTC^ 

TCTTGTGGAGTGGTCATTCCTTCCTCACAGTAC^GTTMGACTTTTACATTGTACTTGGATCAMTAGCTACATTMTCGGTCAGAC 

ATMCAAMTTATGTGGAAATGACCCAGACAAAATAGTTGTCCCTTTAACCAAGGMCMGTTAGACAAGCCTTTATCAA 

TTGGTCTTGCTMTTTTGTGGGACTTATTGATMTCftT^ 

TACCAGACGTGMCCTTTAGAAMTGCTCTMCAGTAT^^ 

AAMCTCCATATCAATCGGCTCAAAGAGACGAGTTGG^ 

CATATGTAGTACAGGCTACAAGGGATGTTGAGACAGCTCTAATTAAATATAGCATGGATGATCAGTTAAACCAGCTATTCAATTTATTACAACAAAC 

TGTMGAAAMGAAATTTCCCATTTTATATTACTTATATTCGAG^ 

CTGGTATCATCTGCACTCATAAAAGCACMGMCTTCATGCTTTG 

AGGCAAMGATATTGTACMCATTGCACCCAGTGTCMGTCTTACACCTGC^ 

ATTATGGCAMTGGATGTCACGCATGTACCTTCATTTGGMGATTATCATATGTTCATGTAACAGTTGATACTTATTCACATTTCATATGGGCAACT 
TGCCAAACAGGAGAAAGTACTTCCCATGTTAAAAAACATTTATTGTCTTGTTTTGCTGTAATGGGAGTTCCA 

CAGGATATTGTAGTAAAGCTTTCCAAAAATTCTTAAGTCAGTGGAAMTTTCACATACAACAGGAATTCCTTATMTTCCCMGGACAGGCCATAGT 

TGAMGMCTAATAGAACACTCAAAACTCMTTAGTTAAACAAAMGM 

GCACTCTATACTTTAMTTTTTTAAACATTTATAGAAATC&^ 

GAAMCTMTTTGGTGGAAAGATMTAAAMTMGACATC 

AAATCAGCTTCCTGTTTGGTTACCCACTAGACATTTGMGTTCTACM 

CCAGTCACATGGATGGATAATCCTATAGMGTATATGTTAATGAM 

MGAAGGGATGATGATAMTATTTCCATTGGGTATCGTTATCCTCCTATTTGCCTAGGGAGAGCACCAGGATGTTTAATGCCTGCAGTCCAAAATTG 
GTTGGTAGAAGTACCTACTGTCAGTCCCATCAGTAGATTCACTTATCACATGGTAAGCGGGATGTCACTCAGGCCACGGGTAAATTATTTACAAGAC 
TTTTCTTATCAMGATCATTAAAATTTAGACCTAMGGGAM^ 

MGAATGTGTGGCCAATAGTGCGGTGATATTATAAMCAATGAATTTGGMCTATTATAGATTGGGCACCTCGAGGTCAATTCTACCACAATTGCTC 

AGGACAAACTCAGTCGTGTCCAAGTGCACAAGTGAGTCCAGCTGTTGATAGCGACTTAACAGAAAGTTTAGACAAACATAAGCATAAAAAATTGCAG 

TCTTTCTACCCTTGGGAATGGGGAGAAAAAGGMTCTCTACCCCMGACCAAAAATAGTAAGTCCTGTTTCTGGTCCTGAACATCCAGMTTATGG^ 

GGCTTACTGTGGCCTCACACCACATTAGMTTTGGTCTGGAAATCAMCTT^ 

CAGTCTMCAGTTCCTTTACAMGTTGCGTAMGCCCCCTTAT^ 

GAAAATTGTAGATTGCTTACTTGCATTGATTCAACTTTTAATTGGCAACACCGTATTCTGCTGGTGAGAGCAAGAGAGGGCGTGTGGATCCCTGTGT 
CCATGGACCGACCGTGGGAGGCCTCACCATCCGTCCATATmGACTGMGTATTAAMGGTGTTTTAMTAGATCCAAM 
AATTGCAGTGATTATGGGATTAATTGCAGTCACAGCTACGGCTGCTGTAGCAGGAGTTGCATTGCACTCTTCTGTTCAGTCAGTAAACTTTGTTAAT 
GATTGGCAAMGMTTCTACAAGATTGTGGAATTCACMTCTAGTATTGATCAAAMTTGGCAAATCAAATTMT 

GGATGGGAGACAGACTCATGAGCTTAGAACATCGTTTCCAGTTACAATGTGACTGGAATACGTCAGATTTTTGTATTACACCCCAAATTTATAATGA 

GTCTGAGCATCACTGGGACATGGTTAGACGCCATCTACAGGGAAGAGAAGATAATCTCACTTTAGACATTTCCAAATTAAAAGAACAAATTTTCGAA 

GCATCAAAAGCCCATTTAMTTTGGTGCCAGGAACTGAGGCAATTGCAGGAGTTGCTGATGGCCTCGCAMTCTTMCCCTGTC^ 

CCATTGGAAGTACATCGATTATAAATCTCATATTAATCCTTGTGTGCCTGTTTTGTCTGTTGTTAGTCTGCAGGTGTACCCAACAGCTCCGAAGAGA 

CAGCGACCATCGAGMCGGGCCATGATGACGATGGCGGTTTTGTCGAAMGAAM 

GTGTCTGTGTAG 

SEQ ID 19 

MLTDLRAVNAVIQPMGPLQPGLPSPMIPKDWPLIIIDLKDCFFTIPLAEQ 

PVREKFSDCYIIHGIDDILCMETKDKLIDCYTFLQAEVANAGLAIASDKIQTSTPFHYLGMQIENRKIKPQKIEIRKDTLKTLNDFQKLLGDIN^ 
RPTLGIPTYMSNLFSILRGDSDLNSKRMLTPEATKEIKLVEEKIQSAQINRIDPLAPLQLLIFATAHSPTGIIIQNTDLVEWSFLPHSTVKTFTLY 
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LDQIATLIGQTRLRIIKLCGNDPDKIWPLTK^ 
NGKAAYTGPKERVIKTPYQSAQRAELVAVITVLQDFDQPIMISDS^ 
LPGPLTKANEQADLLVSSALIKAQELHALTHVNM^ 
HVTVDTYSHFIWATCQTGESTSHVKKHLLSCFAVMGVPEKIKTD^^ 

DSKECTTPQMQLNLALYTLNFLNIYRNQTTTSAEQHLTGKKNSPHEGKIiIWWKDSKNKTWEIGKVITWGRGFACVS 

IRDAKKSTSAETETSQSSTVDSQDEQNGDVRRTDEVAIHQEGRAANLGTTKEADAVSYKISREHKGDTKPREYMCSLDDCINGGKSPYACRSSC 
SEQ ID 20 

MGPLQPGLPSPAMIPKDWPLIIIDLKDCFFTIPLAEQDCEKF 
IDDILCAAETKDKLIDCYTFLQAEVANA&LAIASDKIQT^ 

FSILRGDSDLNSQRILTPEATKEIKLVEEKIQSAQINRIDPLAPLQLLIFATAHSPTGIIIQNTDLVEWSFLPHSTVKTFTLYLDQIATLIGQTRLR 
ITKLCGNDPDKIWPLTKEQVRQAFINSGAWQIGLAN^ 

KTP YQSAQRDELVAVI TVLQDFDQP INI I S DSAYWQATRDVETALI KYSMDDQLNQLFNLLQQTVRKRNFP FYI TYI RAHTNLPGPLTKANEQADL 

LVSSALIKAQELHALTHVNMGLKNKFDra 

CQTGESTSHVKmLSCFAVMGVPEKIKTDNGPGY 

ALYTLNFINIYRNQTTTSAEQHLTGKKNSPHEGKLI^ 

PVTWMDNPIEVYVNDSIWVPGPIDDRCPAKPEEEGMMINISIGYR 

FSYQRSLKFRPKGKPCPKEIPKESKNTEVLVWEECVANSAVILXNNEFGTIIDWAPRGQFYimCSGQTQSCPSAQVSPAVDSDLTESLDKHKHKKLQ 
SFYPWEWGEKGISTPRPKIVSPVSGPEHPELWRLTVASHHIRIWSGNQT^^ 

ENCRLLTCI DSTFNWQHRILLVRAREGVWI PVSMDRPW£AS PS VHILTEVLKGVLNRSKRFI FTLIAVIMGL IAVTATAAVAGVALHSSVQSVNFVN 

DWQKNSTRLWSQSSIDQKIANQINDLRQTVI^GDRL^ 

ASKAHLNLVPGTEAIAGVADGLANLNPVTWVmGSTSU^ 

VSV 

SEQ ID 21 

NKSRKRRNRESLLGMTVEPPKPI PLTWKTEKPV^ 

PMGPLQPGLPSPAMI PKDWPLI II DLKDCFFTI PLAEQDCEKFAFTI PAINNKEPATRFQWKVLPQGMLNSPTICQTFVGRALQPVREKFSDCY I IH 
CIDDILCAAETKDKLIDCYTFLQAEVMAGLAIASDKIQTSTPFHYLGMQIENRKIKPQKIEIEIKDTLKTLNDFQKLLGDI 
LFSILRGDSDLNSKRMLTPEATKEIKLVEEKIQSAQINRIDPLAPLQLLIFATAHSPTGIIIQNTDLVEWSFLPHSTVKTFTLYLDQIATLIGQTRL 
RIIKLCGNDPDKIWPLTKEQVRQAFINSGAWKIGLANFVGIIDM^ 

IKTPYQSAQRAELVAVITVLQDFDQPINIISDSAYWQATRDVETALIKYSMDDQLNQLFNLLQQTVRKRNFPFYITHIRAHTNLPGPM 

LLVSSALIKAQELHALTHVNMGLKNKFDVTMQA 

TCQTGESTSHVKKHLLSCFAVMGVPEKIKTDNGPGYCSKAFQKFLSQWKI^ 

LALYTLNFLN I YRNQTTTS AEQHLTGKKNS PHEGKLI WWKDNKNKTWE IGKV ITWGRGFACVS PGENQLPVW I PTRHLKFYNE P I RDAKKSTSAETE 

TSQSSTVDSQDEQNGDVRRTDEVAIHQEGRAMLGTTKEADAVSYKISREHKGDTNPREYAACSLDDCINGGKSPYACRSSCS 

SEQ ID 22 

ATGMCCCATCAGAGATGCAAAGAAAAGCACCTCCGCGGAGACGGAGACATCGCMTCGAGCACCGTTGACTCAC^ 
CAGMGMCAGATGMGTTGCCATCCACCAAGMGG^ 

GAACACAAAGGTGACACAAACCCCAGAGAGTATGCTGCTTGCAGCCTTGATGATTGTATCAATGGTGGTAAGTCTCCCTATGCCTGCAGGAGCAGCT 

GCAGCTAACTATACCTACTGGGCCTATGTGCCTTTCCCGCCCTTAATTCGGGCAGTCACATGGATGGATAATCCTACAGAAGTATATGTTAATGATA 

GTGTATGGGTACCTGGCCCCATA.GATGATCGCTGCCCTGCCAAAGCTGAGGAAGAAGGGATGATGATAAATATTTCCATTGGGTATCATTATCCTCC 

TATTTGCCTAGGGAGAGCACCAGGATGTTTAATGCCTGCAGTCCAAAATTGGTTGGTAGAAGTACCTACTGTCAGTCCCATCTGTAGATTCACTTAT 

CACATGGTMGCGGGATGTCACTCAGGCCACGGGTAMTTATTTACAAG 

GCCCCMGGAAATTCCCAAAGAATCAAAAAATACAGAAGTTTTAGTTT^ 

CGGMCTATTATAGATTGGGCACCTCGAGGTCMTTCTACCACMTTGCTCAGGACAAACTCAGTCGTGTCCAAGTGCACAAGTGAGTCCAGCTGTT 
GATAGCGACTTMCAGi^GTTTAGACAMC^TMGCATAAAAMTTGCAGTCTTTCTACCC^ 

GACCAAAMTAGTMGTCCTGTTTCTGGTCCTGMCATCCAGAATTATGGAGGCTTACTGTGGCCTCACACCACATTAGAATTTGGTCTGGAM 

AACTTTAGAAACAAGAGATCGTMGCCATTTTATACTATTGACCTGAATTCCAGTCTMCAGTTCCTTTACAMGTTGCGTAAAGCCCCCTTATATG 

CTAGTTGTAGGAAATATAGTTATTAAACCAGACTCCCAGACTATAACCTGTGAAAATTGTAGATTGCTTACTTGCATTGATTCAACTTTTAATTGGC 

MCACCGTATTCTGCTGGTGAGAGCMGAGAGGGCGTGTGGATCCCTGTGTCCATGGACCGACCGTGGGAGGCCTCGCCATCCGTCCATATTTTGAC 

TGAAGTATTAAMGGTGTTTTAMTAGATCCAAAAGATTCATTTTTACTTTAATTGCAGTGATTATGGGATTAATTGCAGTCACAGCTACGGCTGCT 

GTAGCAGGAGTTGCATTGCACTCTTCTGTTCAGTCAGTAAACTTTGTTAATGATTGGCAAAAAAATTCTACAAGATTGTGGAATTCACAATCTAGTA 

TTGATCAAAMTTGGCAAATCAAATTAATGATCTTAGACAAACTGTCATTTGGATGGGAGACAGACTCATGAGCTTAGAACATCGTTTCCAGTTACA 

ATGTGACTGGAATACGTCAGATTTTTGTATTACACCCCAAATTTATAATGAGTCTGAGCATCACTGGGACATGGTTAGACGCCATCTACAGGGAAGA 

GMGATAATCTC^CTTTAGACATTTCCAAATTAAAAGMCAMTTTTCGAAGCATCAAMGCCCATTTAMTTTGGTGC 

CAGGAGTTGCTGATGGCCTCGCAAATCTTAACCCTGTCACTTGGGTTMGACCATTGGAAGTACTACGATTATAAATCTCATATTAATCCTTGTGrc 
CCTGTTTTGTCTGTTGTTAGTCTGCAGGTGTACCCAACAGCTCCGAAGAGACAGCGACCA 
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SEQ ID 23 

ATGCAAAGAAAAGCACCTCCGCGGAC^CGGAGA^^ 
AGTTGCC^TCCACCMGAAGGCAGACKCGCCMOT 

ACAAACCCCAGAGAGTATGCTGCTTGCAGCCTTGATGATTGTATCAATGGTGGTAAGTCTCCCTATGCCTGCAGGAGCAGCTGCAGCTAACTATACC 
TACTGGGCCTATGTGCCTTTCCCGCCCTTAATTCGGGCAGTCAC^^ 

GCCCCATAGATGATCGCTGCCCTGCCAAACCTGAGGAAGAAGGGATGATGATAAATATTTCCATTGGGTATCATTATCCTCCTATTTGCCTAGGGAG 

AGCACCAGGATGTTTAATGCCTGCAGTCCAAAATTGGTTGGTAGAAGTACCTACTGTCAGTCCCATCTGTAGATTCACTTATCACATGGTAAGCGGG 

ATGTCACTCAGGCCACGGGTAMTTATTTACMGACTTTT^ 

CCAAAGMTCAAAAAATACAGMGTTTTAGTTTC^ 

TTGGGCACCTCGAGGTCMTTCTACCA(MTTGCTCAGGACAAACTCAGTCGTGTCAMGTGCACAAGTGAGTCCAGCTGTTGA 
GAAAGTTTAGACAAACATAAGCATAAAAAATTGCAGTCTTTCTACrc 

GTCCTGTTTCTGGTCCTGAACATCCAGMTTATGGAGGCTTACTGTGGCCTCACACCACATTAGMTTTGGTC^ 

AGATCGTMGCCATTTTATACTATTGACCTGAATTCCAGTCTAACAGTTCCTTTACAAAGTTGCGTAAAGCCCCCTTATATGCTAGTTGTAGGAAAT 
ATAGTTAmMCCAGACTCCCAGACTATM^ 

TGGTGAGAGCAAGAGAGGGCGTGTGGATCCCTGTGTCCATGGACCGACCGTGGGAGGCCTCGCCATCCGTCCATATTTTGACTGMGTATTAAM^ 

TGTTTTAAATAGATCCAAAAGATTCATTTTTACTTTAATTGCAGTGATTATGGGATTAATTGCAGTCACAGCTACGGCTGCTGTAGCAGGAGTTGCA 

TTGCACTCTTCTGTTCAGTCAGTAMCTTTGTC^ 

CAMTCAAATTAATGATCTTAGACAAACTGTCATTTGGATGGGAGA 

GTCAGATTTTTGTATTACACCCCAAATTTATMTGAQTCTGAGCATCACTGGGACATGGTTAGACGCCATCTACAGGGMGAGAAGATAATCTCACT 
TTAGACAmCCAAATTAAMGAACAMTTTTCGMGm 

GCCTCGCAMTCTTMCCCTGTCACTTGGGTTMGACCATTGGAAGTACTACGATTATAMTCTCATATTAATCCTTGTGTGCCTGTTfTGTCTGTT 
GTTAGTCTGCAGGTGTACCCAACAGCTCCGAAGAGACAGCGACCATCGAGAACGGGCCATGATGACGATGGCGGTTTTGTCGAAAAGAAAAGGGGGA 
AATGTGGGGAAAAGCAAGAGAGATCAGATTGTTACTGTGTCTGTGTAG 

SEQ ID 24 

GTCACATGGATGGATAATCCTATAGAAGTATATGTTAATGATAGTGTATGGGTACCTGGCCCCACAGATGATCGCTGCCCTGCCAAACCTGAGGAAG 
MGGGATGATGATAMTATTTCCATTGTGTATCGTTATCCTCCTATTTGCCTAGGGAGAGCACCAGGATGTTTAATGCCTGCAGTCCAAAATTGGTT 

ggtagaagtacctactgtcagtcctaacagtagattcacttatcacatggtaagcgggatgtcactcaggccacgggtaaattatttacaagacttt 
tcttatcaaagatcattaamtttagacctaaagggaaaccttgccc^ 

mtgtgtggccmtagtgcggtgatattacaamcmtgaattcggaactattatagattgggcacctcgaggtcmttctaccacaattgctcagg 
acaaactcagtcgtgtccaagtgcacmgtgagtccagctgttgatagcgacttmcagaaagtctagacaaacatm 
ttctacccttgggaatggggagaaamggaatctctaccccmgaccagaaatmtaagtcctgtttctggtcctgaacatccagaattatggaggc 
ittggcctgacaccacattagaatttggtctg 

acggttcctttacamgttgcgtamgccctcttatatgctagttgtaggamtatagttattaaaccagactcccamctatm 
gtagattgtttacttgcattgattcmcm^^ 

ctgaccgtgggaggcctcgccatccatccatattttgactgmgtattaaaagacattttaaatagatccaaaagattcatttttaccttaattgca 
gtgattatgggattaattgcagtcacagctacggctgctgtggcaggagttgcattgcactcttctgttcagtcggtaaactttgttaatgattggc 
aamgaattctacaagattgtggmttcacmtctagtattgatcaa 

agacagactcatgagcttagmcattgtttccagttacagtgtgactggaatacgtcagatttttgtattacaccccaaatttataatg|agtctgag 

(^tcactgggacatggttagacgccatcta^ 

mgcccatttamtttgatgccaggaactgaggc^^ 

mgtactatgattataflatctcatattaatccttgtgtgcctgttttgtctgttgttagtctgcaggtgtacccaacagctccgaagagacagcgffc 
catcgagaacgggcca 

SEQ ID 25 

ATGGGGCCTCTCCAACCCGGGTTGCCCTCTCCGGCCATGATCCCAAMGATTGGCCTTTAATTATAATTGATCTAAAGGATTGCTTTTTTACCATCC 
CTCTGGCAGAGCAGGATTGTGAAAAATTTGCCTTTACTATACCAGCCATAAATAATAAAGAACCAGCCACCAGGTTTCAGTGGAAAGTGTTACCTCA 
GGGAATGCTTMTAGTCCMCTATTTGTCAGACTTTTGTAGGTCGAGCTCTTCAACCAGTGAGAGAAAAGTTTTCAGACTGTTATATTATTCATTAT 
ATTGATGATATTTTATGTGCTGCAGAAACGAAAGATAAATTAATTGACTGTTATACATTTCTGCAAGCAGAGGTTGCCAATGCTGGACTGGCAATAG 
CATCCGATMGATCCAAACCTCTACTCCTTTTC^^ 

CACATTAAAMC^CTAMTGATTTTCAAAAATTACTAGGAGATATTMTTGGATTCGGCCMCTCTAGGCATTCCTACTTATGCCATGTCAMTTTG 

TTCTCTATCTTAAGAGGAGACTCAGACTTAAATAGTCAAAGMTATTAAC^ 

CAGCGCAMTAMTAGMTAGATCCCTTAGCCCCACTtt^ 

TCTTGTGGAGTGGTCATTCCTTCCTCACAGTACAGTTAAGACTTTTACATTGTACTTGGATCAAATAGCTACATTAATCGGTCAGACAAGATTACGA 

ATAACAAAATTATGTGGAAATGACCCAGACAAAATAGTTGTCCCTTTAACCAAGGAACAAGTTAGACAAGCCTTTATCAATTCTGGTGCATGGCAGA 

TTGGTCTTGCTMTTTTGTGGGACTTATTGATMra^ 

TACCAGACGTGMCCTTTAGAAMTGCTCTMCAGM^ 

AAAACTCCATATCMTCGGCTCAMGAGACGAGTTGGTTGCAGTCATTACAGTGTTACMGATTTTGACCM 

CATATGTAGTACAGGCTACMGGGATGTTGAGACAGCTCTAATTAAATATAGCATGGATGATCAGTTAAACCAGCTATTCMTTTATTACAACAAAC 



Copy provided by USPTO from the PACR Image Database on 07/22/2003 



60*72I89*flS2003 

8/25 

TGTAAGMAAAGAAATTTCCCATTTTATATTACTTATATTCGAGCACACACTAATTTACCAGGGCCTTTGACTAAAGCAAATGAACAAGCTGACTTA 
CTGGTATCATCTGCACTCATAAAAGCACMGMOT 

.AGGCAAMGATATTGTACMCATTGCACCCAGTGTCAAGTCTTACACCTGCCCACTCMGAGGCAGGAGTTMTCCCAGAGGTCTGTGTCCTAATGC 
ATTATGGCAAATGGATGTCACGCATGTACCTTCATTTGGAAGATTATCATATGTTCATGTAACAGTTGATACTTATTCACATTTCATATGGGCAACT 
TGCCAAACAGGAGAMGTACTTCCCATGTTAAAAAACATTM 
CAGGATATTGIAGTAMGCTTTCCAAAMTTCTTMGTCAGTGGAAMT^ 

TGAAAGMCTMTAGMCACTCAAMCTCMTTAGTTAAACAAAMGMGGGGGAGACAGTAAGGAGTGTACCACTCCTC 
GCACTCTATACTTTAMTTTTTTAAACATTTATAGAAATCAGA 

GAAAACTMTTTGGTGGAMGATAATAAAAATAAGACATGGGAAATAGGGAAGGTGATAACGTGGGGGAGAGGTTTTGCTTGTGTTTCACCAGGAGA 
AMTCAGCTTCCTGTTTGGmCCCACTAGACATTTGAAGTTCTACAATGAACCCATCGGAGATGCAM 

CCAGTCACATGGATGGATAATCCTATAGAAGTATATGTTAATGATAGTATATGGGTACeTGGCCCCATAGATGATCGCTGCCCTGCCAAACCTGAGG 

MGAAGGGATGATGATAAATATTTCCATTGGGTATCGTTATCCTCCTATTTGCCTAGGGAGAGCACCAGGATGTTTAATGCCTGCAG 

GTTGGTAGAAGTACCTACTGTCAGTCCCATCAGTAGATTCACTTATCACATGGTMGCGGGATGTCACTCAGGCCACGGGTAAATTATTTA 

TTTTCTTATCAAAGATCATTAAAATTTAGACCTAAAGGGAAACCTTrc 

MGAATGTGTGGCCAATAGTGCGGTGATATTATAAMCAATGAATTTGGAA 

AGGACAMCTCAGTCGTGTCCAAGTGCACMGTGAGTCCAGCTGTXGATAGCGACTTMCAGAMGTTTAGACAAACATAAGCATAAAAAATT 

TCTTTCTACCCTTGGGAATGGGGAGAAAAAGGAATCTCTACCCCAAGACCAAAAATAGTMGTCCTGTTTCTGGTCCTGAACATCCAGAATTATG^ 

GGCTTACTGTGGCCTCACACCACATTAGAATTTGGTCTGGAAATCAAACTTTAGAAACAAGAGATTGTAAGCCATTTTATACTGTCGACCTAAATTC 

CAGTCTAACAGTTCCTTTACAAAGTTGCGTAAAGCCCCCTTATATGCTAGTTGTAGGAAATATAGTTATTAAACCAGACTCCCAGACTATAACCTGT 

GAAAATTGTAGATTGCTTACTTGCATTGATTCMCTTTTAATTGGCMCACCGTATTCT(XTGGTGAGAGCAAGAGAGGGCGTGTGGATCCCTGTGT 

CCATGGACCGACCGTGGGAGGCCTCACCATCCGTCCATATTTTGACTGAAGTATTAAAAGGTGTTTTAAATAGATCCAAAAGATTCATTTTTACTTT 

AATTGCAGTGftTTATGGGATTAATTGCAGTCACAGCTACGGCTK 

GATTGGCAAMGMTTCTACAAGATTGTGGAATTCACMTCTAGTATTGATCAAAAATTGGCAMT 

GGATGGGAGACAGACTCATGAGCTTAGAACATCGTTTCCAGTTACMTGTGACTGGMTACGTCAGATTTTTGTATTACACCC 

GTCTGAGCATCACTGGGACATGGTTAGACGCCATCTACAGGGMGAGAAGATM 

GCATCAAAAGCCCATTTAAATTTQGTGCCAGGAACTGAGGCAATTGCAGGAGTTGCTGATGGCCTCGCAAATCTTAACCCTGTCACTTGGGTTAAGA 

CCATTGGAAGTACATCGATTATAAATCTCATATTAATCCTTGTGTGCCTGTTTTGTCTGTTGTTAGTCTGCAGGTGTACCCAACAGCTCCGAAGAGA 

CAGCGACCATCGAGMCGGGCCATGATGACGATGGCGGTTTO^ 

GTGTCTGTGTAG 

SEQ ID 26 

HQRKAPPRRRRHRNRAPLTHKMNKMVTSEEQMKLPSTKKAEPPTWAQLKKLTQLATKYLENTKVT 
mWPFPPLII&VT^DNPTEVYVNDSVWVPGPIDDRC^ 

MSLRPRVNYLQDFSYQRSLKFRPKGKPCPKEIPKESKNTEVLWEECVANSAVILQNNEFGTIIDWAPRGQFYHNCSGQTQSCQS 

ESLDKHKHKKLQSFYPWEWGEKGISTPRPKIVSPVSGPEHPELWRL^ 

IVIKPDSQTITCENCRLLTCIDSTFNWQHRILLVRAREGV^ 

LHSSVQSVNFV^DWQKNSTRLWNSQSSIDQKLANQINDLRQ 

LDISKLKEQIFEASKAHLNLVPGTEAIAGVADGL^^ 

NVGKSKRDQIVTVSV 

SEQ ID 27 

mgpiiqpglpspamipkdwpliiidlkdcfftiplaeqdcekfaftipainn 
iddii£aaetkdklidcytflqaevmaglaiasdki^^ 

fsilrgdsdlnsqriltpeatkeiklveekiqsaqinridplaplqllifatahsptgiiiqntdlvewsflphstvktftlyldqiatligqtrlr 
itklcgndpdkiwpltkeqvrqafinsgawqiglmfvglidm 

ktpyqsaqrdelvavitvlqdfdqpiniisdsayvvqatrdvetalikysmddqlnqlfnllqqtvrkrnfpfyityirahtnlpgpltkaneqa 

lvssalikaqelmthvnmglknkfdvtwkqakdivqhctqcqvlhlptqeagvnprglcpnalwqmdvthvpsfgrlsyvhvtvdtyshfiwat 

cqtgestshvkkhllscfavmgvpekiktdngpgycskafqkflsqwkishttgipynsqgqaivertnrtlktqlvkqkeggdskecttpqmqlnii 

alytlnflniyrnqtttsaeqhltgkknsphegkliwwkdnknktweigkv™grgfacvspgenqlpvwlpt^^ 

pwwmdnpievyvndsiwvpgpiddrcpakpeeegmminisigyryppiclgrapgc^^ 

fsyqrslkfrpkgkpcpkeipkeskntevlvweecvansavilxnnefgtiidwaprgqfyhncsgqtqscpsaqvspavdsdltesldkhkhkklq 
sfypwewgekgistprpkivspvsgpehpelwrltvashhiriwsgnqt^ 

encrlltcidstfnwqhrillvraregvwipvsmdrpweaspsvhiltevlkgvlnrskrfiftliavimgliavtataavagvalhssv 

dwqknstrlwnsqssidqklanqindlrqtviwmgdrlmsleh^ 

askahlnlvpgteaiagvadgianlnpvtwvktigst^ 

VSV 

SEQ ID 28 

MNPSEMQRKAPPRRRRHRNRAPLTHKMNKMVTSEEQMKLPSTKKAEPPT^^ 
MNYTYWAYVPFPPLIRAVTWMDNPTEVYVNDSWVPGPIDDRCPAKPEEEGM^ 
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HWSGMSLRPRVNYLQDFSYQRSLKFRPKGKPCPKEIPKESKNTEVI.VWEECVANSAVILQNNEFGTIIDWAPRGQEYHNCSGQTQSCPSAQVSPAV 

DSDLTESLDKHKHKKLQSFYPWEWGEKGISTPRPKIVSPVSGPEHPELWRLTVASHHIRIWSGNQTI.ETRDRKPFYTIDLNSSLTVPLQSCVKPPY^ 

LWGNIVIKPDSQTITCENCRLLTCIDSTFNWQHRILLVRAREGVWIPVSMDRPWEASPSVHILTEVLKGVlNRSKRFim 

VAGVALHSSVQSVNFVNDWQKNSTRL?TOSQSSID^ 

EDNLTLDISKLKEQimSKAHLNLVPGTEAIAGVAD^^ 

KRKGGNVGKSKRDQIVTVSV 

SEQ ID 29 

AGTTCTACAATGAACCCAT^GAGATGCAAAGAA^ 

TGGTGACGTCAGAAGAACAGATGMGTTGCCATCCACCMGMGGCAGAGCCGCCMCTTGGGCACMCTAAAGMGCTGACGCAGTTAGCTACAM 
ATATCTAGAGAACACAAAGGTGACACAAACCCCAGAGAGTATGCTGCTTGCAGCCTTGATGATTGTATCAATGGTGGTAAGTCTCCCTATGCCTGCA 
GGA 

SEQ ID 30 

TCTGCAGGTGTACCCAACAGCTCCGAAGAGACAGCGACCATCGAGAACGGGCCATGA 
SEQ ID 31 

MNPSEMQRKAPPRRRRHRNRAPLTHKMNKMVTSEEQMKl^ 
TATIENGP 

SEQ ID 32 

MNPSEMQRKAPPRRRRHRNRAPLTHKMNKMVTSEEQMKLPSTKKAEPPTWAQLKKLTQLATKYLENTKSAGVPNSSE 
SEQ ID 33 

MNPSEMQRKGPPQRCLQVYPTAPKRQRPSRTGHDDDGGFVEKKRGKCGEKQERSDCYCVCVERSRHRRLHFVLY 
SEQ ID 34 

MNSLEMQRKVWRff RHPNRIiASLQVYPAAPKRQQPARMGHS DDGGFVKKKRGGYVRKRE I RLSLCLCRKGRHKKLHFVLY 
SEQ ID 35 

IWPSEMQRKAPPRRRRHRNRAPLTHKMNKMVTSEEQMKLPSTK 
KRGKCGEKQERSDCYCVCVERSRHRRLHFVLY 

SEQ ID 36 

MNPSEMQRKAPPRRRRHRNRAPLTHKMNKMVTSEEQMKLPSTKKAEPPTWAQLKKLTQLATKYI^ 
CGEKQERSDCYCVCVERSRHRRLHFVLY 

SEQ ID 37 

MNPSEMQRKAPPRRRRHRNRAPLTHKMNKMVTSEEQMKLPSTK 
PSRTGHDDDGGFVEKKRGKCGEKQERSDCYCVCVERSRHRRLHFVLY 

, SEQ ID 38 

MNPSEMQRKGPPQRCLQVYPTAPKRQRPSRTGHDDDGGFVEKKRGKCGEKQERSDCYCVCVERSRHRRLHFVLY 
SEQ ID 39 

MNPSEMQRKGPPQRCLQVYPTAPKRQRPSRTGHDDDGGFVEKKRGKCGEKQERSDCYCVCVERSRHRRLHFVLY 
SEQ ID 40 

MEYKNRHLKFYNEPIGDAKKRASTEMSAGVPNSSEETATIENGP 
SEQ ID 41 

MNPSEMQRKGPPQRCLQVYPTAPKRQRPSRTGHDDDGGFVEKKRGKCGEKQERSDCYCVCVERSRHRRLHFVLY 
SEQ ID 42 

MNPSEMQRKAPPRRRRHRNRAPLTHKMNKMVTSEEQMKLPSTKKAEPPTWAQLKKLTQLATKYLENTKSAGVPNSSEETATIENGP 
SEQ ID 43 

MNPSEMQRKAPPRRRRHRNRAPLTHKMNKMVTSEEQMKLPSTKKAEPPTW^^ 
TATIENGP 

SEQ ID 44 

MVTPVTWMDNPIEVYVNDSVWVPGPTDDRCPAKPEEEGMMINISIVYRYPPICLGRAPGCLMPAVQNCLQVYPTAPKRQRPSRTGHDDDGGFVEKKR 

GKCGEKQERSDCYCVCVERSRHRRLHFVLY 

SEQ ID 45 

MVTPVTWMDNPIEVYVNDSEWVPGPTDDRCPAKPEEEGMMINISIGLQ^ 
RRLHFVMC 

SEQ ID 46 

HNSLtMQRKVWRWRHPNRIASLQVYPAAPKRQQPARMGHSDDGGFVKKKRGGWRKRE 
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SEQ ID 47 

MNSLEMQRKAPPRRRRHRNRAPLTHKMNKMVTSEEQMKLSSTKKAEPPT^^ 

MNYTYWAYVPFPPLIRAVTWMDNPTEVYVNDSVWVPGPIDDRCPMPEEEGMMINISIGYHYPPICLGRAPGCLMPAVQNWLVE 
HMSAGVPNSSEETATIENGP 

SEQ ID 48 

MNPSEMQRKAPPRRRRHRNRAPLTHKMNKMW^ 
KRGKCGEKQERSDCYCVCVERSRHRRLHFVMY 

SEQ ID 49 

MNPSEMQRKAPPRRRRHRNRAPLTHKMNKMVTSEEQMKL^ 
CGEKQERSDCYCVCVERSRHRRLHFVMY 

SEQ ID 50 

MNPSEMQRKAPPRRRRHRNRAPLTHKMNKMVTSEEQMKLPSTKKAEPPTWAQLKKLTQLATKYX^ 
RPSRTGHDDDGGFVEKKRGKCGEKQERSDCYCVCVERSRHRRLHFVMY 

SEQ ID 51 

GCCGCGGAATTTCGACTCTAGGCCATTGCATACGTTGTATCTATATCATAATATGTACATTTATATTGGCTCATGTCCAATATGACCGCCATGTTGA 
CATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATG 
GCCCGCCTGGCTGACCGCCCMCGACCCCCGCCCATTGACGTCMTMTGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCA 
ATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCMGTCCGCCCCCTATTGACGTCMTGACGGTAAATGG 
CCCGCCTGGCATTATGCCCAGTACATGACCTTACGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGATGCGGT 
TTTGG&U-nTlCACCMTGGGCGTGGATAGCGGTTTGAC^ 

AAATCAACGGGACTTTCCAAAATGTCGTMTAACCCCGCCCCGTTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCT 
CGTTTAGTGAACCGTCAGATCGCCTGGAGACGCCATCCACGCTGTTTTGACCTCCATAGAAGACACCGGGACCGATCCAGCCTCCGCGGCCGGGAAC 
GGTGCATTGGAACGCGGATTCCCCGTGCCAAGAGTGACGTAAGTACCGCCTATAGACTCTATAGGCACACCCCTTTGGCTCTTATGCATGCTATACT 
GTTTTTGGCTTGGGGCCTATACACCCCCGCTTCCTTATGCTATAGGTGATGGTATAGCTTAGCCTATAGGTGTGGGTTATTGACCATTATTGACCAC 
TCCCCTATTGGTGACGATACTTTCCATTACTAATCCATAACATGGCTCTTTGCCACAACTATCTCTATTGGCTATATGCCAATACTCTGTCCTTCAG 
AGACTGACACGGACTCTGTATTTTTACAGGATGGGGTCCCATTTATTATTTACAAATTCACATATACAACAACGCCGTCCCCCGTGCCCGCAGTTTT 
TATTAAACATAGCGTGGGATCTCCACGCGAATCTCGGGTACGTGTTCCGGACATGGGCTCTTCTCCGGTAGCGGCGGAGCTTCCACATCCGAGCCCT 
GGTCCCATGCCTCCAGCGGCTCATGGTCGCTCGGCAGCTCCTTGCTCCTA&CAGTGGAGGCCAGACTTAGGCACAGCACAATGCCCACCACCACCAG 
TGTGCCGCACAAGGCCGTGGCGGTAGGGTATGTGTCTGAAAATGAGCTCGGAGATTGGGCTCGCACCGCTGACGCAGATGGAAGACTTAAGGCAGCG 
GCAGAAGAAGATGCAGGCAGCTGAGTTGTTGTATTCTGATAAGAGTCAGAGGTAACTCCCGTTGCGGTGCTGTTAACGGTGGAGGGCAGTGTAGTCT 
GAGC^GTACTCGTTGCTGCCGCGCGCGCCACCAGACATMTAGCTGACAGACTMCAGACTGTTCCTTTCCATGGGTCTTTTCTGCAGTCACCGTCg 
tcgacgccaccatgaaccccagcgagatgcagcgcaaggcccccccccgccgccgccgccaccgcaaccgcgcccccctgacccacaagatgaacaa 
gatggtgaccagcgaggagcagatgaagctgcccagcaccaagaaggccgagccccccacctgggcccagctgaagaagctgacccagctggccacc 
aagtacctggagaacaccaaggtgacccagacccccgagagcatgctgctggccgccctgatgatcgtgagcatggtgagcgccggcgtgcccaaca 
gcagcgaggagaccgccaccatcgagaacggccccgcttaaagaattcAGACTCGAGCAAGTCTAGAAAGCCATGGATATCGGATCCACTACGCGTT 
AGAGCTCGCTGATCAGCCTCGACTGTGCCTTCM 

CCACTGTCCTTTCCTMTAAAATGAGGAMTTGCATCGCATTGTCTGAGTAGGTGTCATTCTATTCTGGGGGGTGGGGTGGGGCAGGACAGCAAGGG 

GGAGGATTGGGAAGACAATAGCAGGGGGGTGGGCGAAGAACTCCAGCATGAGATCCCCGCGCTGGAGGATCATCCAGCCGGCGTCCCGGAAAACGAT 

TCCGAAGCCCAACCTTTCATAGMGGCGGCGGTGGAATC.GAAATCTCGTGATGGCAGGTTGGGCGTCGCTTGGTCGGTCATTTCGM 

CCGCTCAGAAGAACTCGTCAAGAAGGCGATAGAAGGCGATGCGCTGCGAATCGGGAGCGGCGATACCGTAAAGCACGAGGAAGCGGTCAGCCCATTC 

GCCGCCAAGCTCTTCAGCAATATCACGGGTAGCCAACGCTATGTCCTGATAGCGGTCCGCCACACCCAGCCGGCCACAGTCGATGAATCCAGAAAA^ 

CGGCCATTTTCCACCATGATATTCGGCAAGCAGGCATCGCCATGGGTCACGACGAGATCCTCGCCGTCGGGCATGCGCGCCTTGAGCCTGGCGAACA 

GTTCGGCTGGCGCGAGCCCCTGATGCTCTTCGTCCAGATCATCCTGATCGACAAGACCGGCTTCCATCCGAGTACGTGCTCGCTCGATGCGATGTTT 

CGCTTGGTGGTCGAATGGGCAGGTAGCCGGATCAAGCGTATGCAGCCGCCGCATTGCATCAGCCATGATGGATACTTTCTCGGCAGGAGCAAGGTGA 

GATGACAGGAGATCCTGCCCCGGCACTTCGCCCMTAGCAGCCAGTCCCTTCCCGCTTCAGTGACAACGTCGAGCACAGCTGCGCAAGGAACGCCCG 

TCGTGGCCAGCCACGATAGCCGCGCTGCCTCGTCCTGCAGTTCATTCAGGGCACCGGACAGGTCGGTCTTGACAAAAAGM 

TGACAGCCGGAACACGGCGGCATCAGAGCAGCCGATTGTCTGTTGTGCCCAGTCATAGCCGAATAGCCTCTCCACCCAAGCGGCCGGAGAACCTGCG 

TGCMTCCATCTTGTTCMTCATGCGAMCGATCCTCATCCTGTCTCTTGATCAGATCTTGATCCCCTGCGCCATCAGATCCTTGGCGGCAAGAAAG 

CCATCCAGTTTACTTTGCAGGGCTTCCCAACCTTACCAGAGGGCGCCCCAGCTGGCAATTCCGGTTCGCTTGCTGTCCATAAAACCGCCCAGTCTAG 

CTATCGCCATGTMGCCCACTGCMGCTACCTGCTTTCTCTTTGCGCTTGCGTTTTCCCTTGTCCAGATAGCCCAGTAGCTGACATTCATCCGGGGT 

CAGCACCGTTTCTGCGGACTGGCTTTCTACGTGTTCCGCTTCCTTTAGCAGCCCTTGCGCCCTGAGTGCTTGCGGCAGCGTGAAGCTAATTCATGGT 

TAAATTTTTGTTAAATCAGCTCATTTTTTMCCMTAGGC^ 

TGTTCCAGTTTGGMCMGAGTCCACTATTAAAGAA^ 

TGCGGTGTGAAATACCGCACAGATGCGTAAGGAGAAAATACCGCATCAGGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCG 
GCTGCGGCGAGCGGTATCAGCTCACTCAMGGCGGTMTACGGTTATCCACAGMTCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAAGGCCAG 
CAAMGGCCAGGAACCGTAAAMGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGACGCTCM 
GGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATA 
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:gcctttctcccttcgggaagcgtggcgctttctcatagctcacgctgtaggtatctcagttcggtgtaggtcgttcgctccaagctgggc 
tgtgtgcacgaaccccccgttcagcccgarc^ 
cagcagccactggtaacaggactagcagagcgaggtatgtaggcggt(^ 
agtatttggtatctgcgctctgctgmgccagttaccttc^^ 
ttttttgtttgcaagcagcagattacgcgcagaaaaamg^ 

G 

SEQ ID 52 

GCCGCGGAATTTCGACTCTAGGCCATTGCATACGTTGTATC 

CATTGATTATTGACTAGTTATTMTAGTMTCAATTACGGGGTCATTAGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATG 
GCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCA 
ATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTCCGCCCCCTATTGACGTCAATGACGGTAAATGG 
CCCGCCTGGCATTATGCCCAGTACATGACCTTACGGGACTTTCGTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGATGCGGT 
TTTGGCAGTACACCMTGGGCGTGGATAGCGGTTTGACTCACGGGGATTTC 

AMTCAACGGGACTTTCCAAMTGTCGTAATAACCCCGCCCCGTTGACGCAAATGGGCGGTAGQCGTGTACGGTGGGAGGTCTATATAAGCAGAGCT 
CGTTTAGTGAACCGTCAGATCGCCTGGAGACGCCATCCACGCTGTTTTGACCTCCATAGAAGACACCGGGACCGATCCAGCCTCCGCGGCCGGGAAC 
GGTGCATTGGAACGCGGATTCCCCGTGCCAAGAGTGACGTAAGTACCGCCTATAGACTCTATAGGCACACCCCTTTGGCTCTTATGCATGCTATACT 
GmTTGSCTTGQBQCCTATRCWXCOOGCTTC^ 

TCCCCTATTGGTGACGATACTTTCCATTACTAATCCATMCATGGCTCTTTGCCACMCTATCTCTATTGGCW 

AGACTGACACGGACTCTGTATTTTTACAGGATGGGGTCCCATTTATTATTTACAAATTCACATATACAAC7^ACGCCGTCCCCCGTGCCCGCAGTTTT 
TATTAAACATAGCGTGGGATCTCCACGCGAATCTCGGGTACGTGTTCCGGACATGGGCTCTTCTCCGGTAGCGGCGGAGCTTCCACATCCGAGCCCT 
GGTCCCATGCCTCCAGCGGCTCATGGTCGCTCGGCAGCTCCTTGCTCCTAACAGTGGAGGCCAGACTTAGGCACAGCACAATGCCCACCACCACCAG 
TGTGCCGeACMGGCCGTGGCGGTAGGGTATGTGTCTGAAAATGA^ 

G(^GMGMGATGCAGG(^GCTGAGTTGTTGTATTCTGATAAGAGTCAGAGGTAACTCCCGTTGCGGTGCTGTTAA^ 

GAGCAGTACTCGTTGCTGCCGCGCGCGCCACCAGACATAATAGCTGACAGACTAACAGACTGTTCCTTTCCATGGGTCTTTTCTGCAGTCACCGTCg 
tcgacgccaccatgaaccccagcgagatgcagcgcaaggcccccccccgccgccgccgccaccgcaaccgcgcccccctgacccacaagatgaacaa 
gatggtgaccagcgaggagcagatgaagctgcccagcaccaagaaggccgagccccccacctgggcccagctgaagaagctgacccagctggccacc 
aagtacctggagaacaccaaggtgacccagacccccgagagcatgctgctggccgccctgatgatcgtgagcatggtggtgtaccccaccgccccca 
agcgccagcgccccagccgcaccggccacgacgacgacggcggcttcgtggagaagaagcgcggcaagtgcggcgagaagcaggagcgcagcgactg 
ctactgcgtgtgcgtggagcgcagccgccaccgccgcctgcacttcgtgctgtacgcttaaagaattcAGACTCGAGCAAGTCTAGAAAGCCATGGA 
TATCGGATCCACTACGCGTTAGAGCTCGCTGATCAGCCTCGACTGTGCCTTCTAGTTGCCAGCCATCTGTTGTTTGCCCCTCCCCCGTGCCTTCCTT 
GACCCTGGMGGTGCCACTCCCACTGTCCTTTCCTMTAAAATGAGGAAATTGCATCGCATTGTCTGAGTAGGTGTCATTCTATTCTGGGGGGTGGG 
GTGGGGCAGGACAGCMGGGGGAGGATTGGGAAGACAATAGCAGGGGGGTGGGCGMGAACTC 

CCGGCGTCCCGGAAAACGATTCCGMGCCCMCCTTTCATAGAAGGCGGCGGTGGMTCGAMTCTCGTGATGGCAGGTTGGGCGTCGCTTGGTCGG 
TCATTTCGMCCCCAGAGTCCCGCTCAGMGMCTCGTCAAGAAGGCGATAGAAGGCGATGCGCTGCGMTCGGGAGCGGCGATACCGTAAAGCACG 
AGGMGCGGTCAGCCCATTCGCCGCCAAGCTCTTCAGCAATATC^ 

AGTCGATGAATCCAGAAAAGCGGCCATTTTCCACCATGATATTCGGCAAGCAGGCATCGCCATGGGTCACGACGAGATCCTCGCCGTCGGGCATGCG 

CGCCTTGAGCCTGGCGAACAGTTCGGCTGGCGCGAGCCCCTGATGCTCTTCGTCCAGATCATCCTGATCGACAAGACCGGCTTCCATCCGAGTACGT 

GCTCGCTCGATGCGATGTTTCGCTTGGTGGTCGAATGGGCAGGTAGCCGGATCAAGCGTATGCAGCCGCCGCATTGCATCAGCCATGATGGATACTT 

TCTCGGCAGGAGCMGGTGAGATGACAGGAGATCCTGCCCCGGCACTTCGCCCMTAGCAGCCAGTCCCTTCCCGCTTCAGTGACMCGTCGAGCAC 

AGCTGCGCAAGGMCGCCCGTCGTGGCCAGCCACGATAGCCGCGCTGCCTCGTCCTGCAGTTCATTCAGGGCACCGGACAGGTCGGTCTTGACA 

AGAACCGGGCGCCCCTGCGCTGACAGCCGGAACACGGCGGCATCAGAGCAGCCGATTGTCTGTTGTGCCCAGTCATAGCCGAATAGCCTCTCCACCC 

AAGCGGCCGGAGAACCTGCGTGCAATCCATCTTGTTCAATCATGCGAAACGATCCTCATCCTGTCTCTTGATCAGATCTTGATCCCCTGCGCCATCA 

GATCCTTGGCGGCMGAMGCCATCCAGTTTACTTTGCAGGGCTTCCCMCCTTACCAGAGGGCGCCCCAGCTGGCMTTCCGGTTCGCTTGCT 

CATAAMCCGCCCAGTCTAGCTATCGCCATGTMGCC^ 

AGCTGACATTCATCCGGGGTCAGCACCGTTTCTGCGGACTGGCTTTCTACGTGTTCCGCTTCCTTTAGCAGCCCTTGCGCCCTGAGTGCTTGCGGCA 
GCGTGAAGCTMTTCATGGTTAMTTTTTGTTAAATC^^ 

CCCGAGATAGGGTTGAGTGTTGTTCCAGTTTGGAACAAGAGTCCACTATTAAAGAACGTGGACTCCAACGTCAAAGGGCGAAAAACCGTCTATCAGG 

GCGATGGCCGGATCAGCTTATGCGGTGTGAAATACCGCACAGATGCGTAAGGAGAAAATACCGCATCAGGCGCTCTTCCGCTTCCTCGCTCACTGAC 

TCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGGTMTACGGTTATCCACAGAATCAGGGGATMCGCAGGAM 

ACATGTGAGCAAMGGCCAGCAAMGGCCAGGMCCGTAAAMGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACM 

AAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCG 

ACCCTGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGG 

TCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAG 

ACACGACTTATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAAC 

TACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAA 

CC^CCGCTGGTAGCGGTGGTTTTTTTGTTTGCMGCAGCAGATTACGCGCAGAAAAAMGGATCTCMGMGATCCTTTGATCTTTTCTACTGMCG 
GTGATCCCCACCGGAATTGCG 
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SEQ ID 53 

GCCGCGGAATTTCGACTCTAGGCCATTGCATACGTTGTATCTATATCATAATATGTACATTTATATTGGCTCATGTCCMTATGACCGCCATGTTGA 
CATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATG 
GCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCA 
ATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTCCGCCCCCTATTGACGTCAATGACGGTAAATGG 
CCCGCCTGGCATTATGCCCAGTACATGACCTTACGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGATGCGGT 
TTTGGCAGTACACCAATGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCMGTCTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCA 
AMTCMCGGGACTTTCCAAMTGTCGTMTMCCW 

CGTTTAGTGMCCGTCAGATCGCCTGGAGACGCCATCCACGCTGTTTTGACCTCCATAGAAGACACCGGGACCGATCCAGCCTCCGCGGCCGGGAAC 

GGTGCATTGGAACGCGGATTCCCCGTGCCAAGAGTGACGTAAGTACCGCCTATAGACTCTATAGGCACACCCCTTTGGCTCTTATGCATGCTATACT 

GTTTTTGGCTTGGGGCCTATACACCCCCGCTTCCTTATGCTATAGGTGATGGTATAGCTTAGCCTATAGGTGTGGGTTATTGACCATTATTGACCAC 

TCCCCTATTGGTGACGATACTTTCCATTACTAATCCATMC^^ 

AGAC^CACGGACTCTGTATTTTTACAGGATG^ 

TATTAAACATAGCGXGGGATCTCCACGCGAATCTCGGGTACGTGTTCCGGACATGGGCTCTTCTCCGGTAGCGGCGGAGCTTCCACATCCGAGCCCT 

GGTCCCATGCCTCCAGCGGCTCATGGTCGCTO 

TGTGCCGCACMGGCCGTGGCGGTAGGGTATGTGTC^^ 

GCAGAAGAAGATGCAGGCAGCTGAGTTGTTGTATTCTGATAAGAGTCAGAGGTAACTCCCGTTGCGGTGCTGTTAACGGTGGAGGGCAGTGTAGTCT 
GAGCAGTACTCGTTGCTGCCGCGCGCGCCACCAGACATAATAGCTGACAGACTAACAGACTGTTCCTTTCCATGGGTCTTTTCTGCAGTCACCGTCg 
tcgacgccaccatggggcaaactgaaagtaaatatgcctcttatctcagctttattaaaattcttttaagaagagggggagttagagcttctacaga 
aaatctaattacgctatttcaaacaatagaacaattctgcccatggtttccagaacagggaactttagatctaaaagattgggaaaaaattggcaaa 
gaattaaaacaagcaaatagggaaggtaaaatcatcccacttacagtatggaatgattgggccattatta.aagcaactttagaaccatttcaaacag 
gagaagatattgtttcagtttctgatgcccctaaaagctgtgtaacagattgtgaagaagaggcagggacagaatcccagcaaggaacggaaagttc 
aMttgtaaatatgtagcagagtctgtaatggctcagtcaacgcaaaatgttgactacagtcaattacaggagataatataccctgaatcatcaaaa 
ttgggggaaggaggtccagaatcattggggccatcagagcctaaaccacgatcgccatcaactcctcctcccgtggttcagatgcctgtaacattac 
aacctcaaacgcaggttagacaagcacaaaccccaagagaaaatcaagtagaaagggacagagtctctatcccggcaatgccaactcagatacagta 
tccacaatatcagccggtagaaaataagacccaaccgctggtagtttatcaataccggctgccaaccgagcttcagtatcggcctccttcagaggtt 
caatacagacctcaagcggtgtgtcctgtgccaaatagcacggcaccataccagcaacccacagcgatggcgtctaattcaccagcaacacaggacg 
cggcgctgtatcctcagccgcccactgtgagacttaatcctacagcatcacgtagtggacagggtggtgcactgcatgcagtcattgatgaagccag 
aaaacagggcgatcttgaggcatggcggttcctggtaattttacaactggtacaggccggggaagagactcaagtaggagcgcctgcccgagctgag 
actagatgtgaacctttcaccatgaaaatgttaaaagatataaaggaaggagttaaacaatatggatccaactccccttatataagaacattattag 
attccattgctcatggaaatagacttactccttatgactgggaaattttggccaaatcttccctttcatcctctcagtatctacagtttaaaacctg 
gtggattgatggagtacaagaacaggtacgaaaaaatcaggctactaagcccactgttaatatagacgcagaccaattgttaggaacaggtccaaat 
tggagcaccattaaccaacaatcagtgatgcagaatgaggctattgaacaagtaagggctatttgcctcagggcctggggaaaaattcaggacccag 
gaacagctttccctattaattcaattagacaaggctctaaagagccatatcctgactttgtggcaagattacaagatgctgctcaaaagtctattac 
agatgacaatgcccgaaaagttattgtagaattaatggcctat'gaaaatgcaaatccagaatgtcagtcggccataaagccattaaaaggaaaagtt 
ccagcaggagttgatgtaattacagaatatgtgaaggcttgtgatgggattggaggagctatgcataaggcaatgctaatggctcaagcaatgaggg 
ggctcactctaggaggacaagttagaacatttgggaaaaaatgttafaattgtggtcaaatcggtcatctgaaaaggagttgcccagtcttaaataa 
acagaatataataaatcaagctattacagcaaaaaataaaaagccatctggcctgtgtccaaaatgtggaaaaggaaaacattgggccaatcaatgt 
cattctaaatttgataaggatgggcaaccattgtcgggaaacaggaagaggggccagcctcaggccccccaacaaactggggcattcccagttcaac 
tgtttgttcctcagggttttcaaggacaacaacccctacagaaaataccaccacttcagggagtcagccaattacaacaatccaacagctgtcccgc 
gccacagcaggcagcaccgcagtaagaattcagactcgagcaagtctagaAAGCCATGGATATCGGATCCACTACGCGTTAGAGCTCGCTGATCAGC 
CTCGACTGTGCCTTCTAGTTGCCAGCCATCT^ 

TAAAATGAGGAAATTGCATCGCATTGTCTGAGTAGGTGTCATTCTATTCTGGGGGGTGGGGTGGGGCAGGACAGCAAGGGGGAGGATTGGGAAGACA 
ATAGCAGGGGGGTGGGCGAAGMCTCCAGCATGAGATCCCCGCGCTGGAGGATCATC^ 

-CATAGMGGCGGCGGTGGAATCGAMTCTCGTGATGGCAGGTTGGGCGTCGCTTGGTCGGTCATTTCGAACCCCAGAGTCCCGCTCAGAAG^ 
TCMGMGGCGATAGMGGCGATGCGCTGCGAATCGGGAGCGGCGATACCGTAAAGCACGAGGAAGCGGTCAGCCCATTCGCCGCCAAGCTCTTCAG 
CAATATCACGGGTAGCCAACGCTATGTCCTGATAGCGGTCCGCCACACCCAGCCGGCCACAGTCGATGAATCCAGAAAAGCGGCCATTTTCCACCAT 
GATATTCGGCAAGCAGGCATCGCCATGGGTCACGACGAGATCCTGGCCGTCGGGCATGCGCGCCTTGAGCCTGGCGAACAGTTCGGCTGGCGCGAGC 
CCCTGATGCTCTTCGTCCAGATCATCCTGATCGACAAGACCGGCTTCCATCCGAGTACGTGCTCGCTCGATGCGATGTTTCGCTTGGTGGTCGAATG 
GGCAGGTAGCCGGATCAAGCGTATGCAGCCGCCGCATTGCATCAGCCATGATGGATACTTTCTCGGCAGGAGCAAGGTGAGATGACAGGAGATCCTG 
CCCCGGCACTTCGCCCAATAGCAGCCAGTCCCTTCCCGCTTCAGTGACAACGTCGAGCACAGCTGCGCAAGGAACGCCCGTCGTGGCCAGCCACGAT 
AGCCGCGCTGCCTCGTCCTGCAGTTCATTCAGGGCACCGGACAGGTCGGTCTTGACAAAAAGAACCGGGCGCCCCTGCGCTGACAGCCGGAACACGG 
CGGCATCAGAGCAGCCGATTGTCTGTTGTGCCCAGTCATAGCCGAATAGCCTCTCCACCCAAGCGGCCGGAGAACCTGCGTGCAATCCATCTTGTTC 
AATCATGCGAAACGATCCTCATCCTGTCTCTTGATCAGATCTTGATCCCCTGCGCCATCAGATCCTTGGCGGCAAGAAAGCCATCCAGTTTACTTTG 
CAGGGCTTCCCAACCTTACCAGAGGGCGCCCCAGCTGGCAATTCCGGTTCGCTTGCTGTCCATAAAACCGCCCAGTCTAGCTATCGCCATGTAAGCC 
CACTGCAAGCTACCTGCTTTCTCTTTGCGCTTGCGTTTTCCCTTGTCCAGATAGCCCAGTAGCTGACATTCATCCGGGGTCAGCACCGTTTCTGCGG 
ACTGGCTTTCTACGTGTTCCGCTTCCTTTAGCAGCCCTTGCGCCCTGAGTGCTTGCGGCAGCGTGAAGCTAATTCATGGTTAAATTTTTGTTAAATC 
AGCTCATTTTTTAACCAATAGGCCGAAATCGGCAAAATCCCTTATAAATCAAAAGAATAGCCCGAGATAGGGTTGAGTGTTGTTCCAGTTTGGAACA 
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AGAGTCCACTATTAMGMCGTGGACTCCMCGTCAMGGGCGAAAAACCGTC^ 

CACAGATGCGTAAGGAGAAAATACCGCATCAGGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTAT 
CAGCTCACTCAMGGCGGTMTACGGm 

TAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACA 
GGACTATAAAGATACCAGGCGTTTCCCCCTGGMGCTCCCTCGT^ 

CTTCGGGMGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCM 

CGTTCAGCCCGACCGCTGCGCCTTATCCGGTMCTATCGTCTTGAGTCCMCCCGGTMGACACGACTTATCGCCACTGGCAGCAGCCACTGGTAAC 
AGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCG 
CTCTGCTGMGCC^GraCCTTCGGAAAAAGAGTTG^ 

GCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGATCTTTTCTACTGAACGGTGATCCCCACCGGAATTGCG 
SEQ ID 54 

GCCGCGGMTTTCGACTCTAGGCCATTGCATACGTTGTATCTATATCATAATATGTACATTTATATTGGCTCATGTCCAATATGACCGCCATGTTGA 
CATTGATTATTGACTAGTTATTMTAGTAAT(MTTACGGGGTCATTAGTTCATAGCCCATATATGGAGTrcCGCG 

GCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCA 

ATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGTACATCMGTGTATCATATGCCMGTCCGCCCCCTATTGACGTCAATGACGGTAAATGG 

CCCGCCTGGCATTATGCCCAGTACATGACCTTACGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGATGCGGT 

TTTGGCAGTACACCMTGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAGTCTCCACCCCATTGACGTC^ 

AMTCAACGGGACTTTCCAAMTGTCGTMTMCCCCGCCCCGTTGACGCAMTGGGCGGTAGGCGTGTACGGTGGGAGGTCTATATM 

CGTTTAGTGAACCGTCAGATCGCCTGGAGACGCCATCCACGCTGTTTTGACCTCCATAGAAGACACCGGGACCGATCCAGCCTCCGCGGCCGGGAAC 

GGTGCATTGGAACGCGGATTCCCCGTGCCAAGAGTGACGTAAGTACCGCCTATAGACTCTATAGGCACACCCCTTTGGCTCTTATGCATGCTATACT 

Q T ^Y^YGGCTTGGGGCCTATACACCCCCGCTTCCTTATGCTATAGGTGATGGTATAGCTTAGCCTATAGGTGTGGGTTATTGACCATTATTGACCAC 

TCCCCTATTGGTGACGATACTTTCCATTACTAATCCATAACATGGCTCTTTGCCACAACTATCTCTATTGGCTATATGCCAATACTCTGTCCTTCAG 

AGACTGACACGGACTCTGTATTTTTACAGGATGGGGTCCCATTTATTATTTACAAATTCACATATACAACAACGCCGTCCCCCGTGCCCGCAGTTTT 

TATTAAAC^TAGCGTGGGATCTCCACGCGMTCTCGGGTACGTGTTCCGGACATGGGCTCTTCTCCGGTAGCGGCGGAGCTTCCACATCCGAGCCCT 

GGTCCCATC&CTO^GCGGCTCATGGTCGC^^ 

TGTGCCGCACAAGGCCGTGGCGGTAGGGTATGTGTCTGAAAATGAGCTCGGAGATTGGGCTCGCACCGCTGACGCAGATGGAAGACTTAAGGCAGCG 
GCAGAAGAAGATGCAGGCAGCTGAGTTGTTGTATTCTGATAAGAGTCAGAGGTAACTCCCGTTGCGGTGCTGTTAACGGTGGAGGGCAGTGTAGTCT 
GAGCAGTACTCGTTGCTGCCGCGCGCGCCACCAGACATAATAGCTGACAGACTAACAGACTGTTCCTTTCCATGGGTCTTTTCTGCAGTCACCGTCg 
tcgacgccaccatgggccagaccaagagcaagatcaagagcaagtacgccagctacctgagcttcatcaagatcctgctgaagcgcggcggcgtgaa 
ggtgagcaccaagaacctgatcaagctgttccagatcatcgagcagttctgcccctggttccccgagcagggcaccctggacctgaaggactggaag 
cgcatcggcaaggagctgaagcaggccggccgcaagggcaacatcatccccctgaccgtgtggaacgactgggccatcatcaaggccgccctggagc 
ccttccagaccgaggaggacagcgtgagcgtgagcgacgcccccggcagctgcatcatcgactgcaacgagaacacccgcaagaagagccagaagga 
gaccgagggcctgcactgcgagtacgtggccgagcccgtgatggcccagagcacccagaacgtggactacaaccagctgcaggaggtgatctacccc 
gagaccctgaagctggagggcaagggccccgagctggtgggccccagcgagagcaagccccgcggcaccagccccctgcccgccggccaggtgcccg 
tgaccctgcagccccagaagcaggtgaaggagaacaagacccagccccccgtggcctaccagtactggccccccgccgagctgcagtaccgcccccc 
ccccgagagccagtacggctaccccggcatgccccccgccccccagggccgcgccccctacccccagccccccacccgccgcctgaaccccaccgcc 
ccccccagccgccagggcagcaagctgcacgagatcatcgacaagagccgcaaggagggcgacaccgaggcctggcagttccccgtgaccctggagc 
ccatgccccccggcgagggcgcccaggagggcgagccccccaccgtggaggcccgctacaagagcttcagcatcaagaagctgaaggacatgaagga 
gggcgtgaagcagtacggccccaacagcccctacatgcgcaccctgctggacagcatcgcccacggccaccgcctgatcccctacgactgggagatc 
ctggccaagagcagcctgagccccagccagttcctgcagttcaagacctggtggatcgacggcgtgcaggagcaggtgcgccgcaaccgcgccgcca 
acccccccgtgaacatcgacgccgaccagctgctgggcatcggccagaactggagcaccatcagccagcaggccctgatgcagaacgaggccatcga 
gcaggtgcgcgccatctgcctgcgcgcctgggagaagatccaggaccccggcagcacctgccccagcttcaacaccgtgcgccagggcagcaaggag 
ccctaccccgacttcgtggcccgcctgcaggacgtggcccagaagagcatcgccgacgagaaggcccgcaaggtgatcgtggagctgatggcctacg 
agaacgccaaccccgagtgccagagcgccatcaagcccctgaagggcaaggtgcccgccggcagcgacgtgatcagcgagtacgtgaaggcctgcga 
cggcatcggcggcgccatgcacaaggccatgctgatggcccaggccatcaccggcgtggtgctgggcggccaggtgcgcaccttcggccgcaagtgc 
tacaactgcggccagatcggccacctgaagaagaactgccccgtgctgaacaagcagaacatcaccatccaggccaccaccaccggccgcgagcccc 
ccgacctgtgcccccgctgcaagaagggcaagcactgggccagccagtgccgcagcaagttcgacaagaacggccagcccctgagcggcaacgagca 
gcgcggccagccccaggccccccagcagaccggcgccttccccatccagcccttcgtgccccagggcttccagggccagcagccccccctgagccag 
gtgttccagggcatcagccagctgccccagtacaacaactgcccccccccccaggccgccgtgcagcaggcttaaagaattcAGACTCGAGCAAGTC 

tagaaagccatggatatcggatccactacgcgttagagctcgctgatcagcctcgactgtgccxtctagttgccagccatctgttgtttgcccctcc 

cccgtgccttccttgaccctggaaggtgccactcccactgtcctttcctaataaaatgaggaaattgcatcgcattgtctgagtaggtgtcattcta 

ttctggggggtggggtggggcaggacagcaagggggaggattgggaagacaatagcaggggggtgggcgaagaactccagcatgagatccccgcgct 

ggaggatcatccagccggcgtcccggaaaacgattccgaagcccaacctttcatagaaggcggcggtggaatcgaaatctcgtgatggcaggttggg 

cgtcgcttggtcggtcatttcgaaccccagagtcccgctcagaagaactcgtcaagaaggcgatagaaggcgatgcgctgcgaatcgggagcggcga 

taccgtaaa(k:acgaggaagcggtcagcccattcgccgccmgctcttcagcmtatcacgggtagccmcg 

acccagccggccacagtcgatgaatccagaaaagcggccattttccaccatgatattcggcaagcaggcatcgccatgggtcacgacgagatcctcg 

ccgtcgggcat(kgc(£cttgagcctggcgmcagttc^ 

ccatccgagtacgtgctcgctcgatgcgatgm^ 
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CATGATG(MACTTTCTOGGCAGGAGCAAGGTG^ 
ACAACGTCGAGCACAGCTGCGCMGGAACGCCCGTCGTGGCCAGCCACGATAGCCGCGCTGCCTCGTCCTGCAGTTCATTCAGGGCflCCGGACAGGT 

CGGTCTTGACAAAAAGAACCGGGCGCCCCTGCGCTGACAGCCGGAACACGGCGGCATCAGAGCAGCCGATTGTCTGTTGTGCCCAGTCATAGCCGAA 
TAGCCTCTCCACCC^GCGGCCGGAGAACCTGCGTGCAATCCATCTTGTTC^ 

CCCCTGCGCCATCAGATCCTTGGCGGCAAGAMGCCATCCAGTTTACTTTGCAGGGCTTCCCAACCTTACCAGAGGGCGCCCCAGCTGGCAATTCCG 

GTTCGCTTGCTGTCCATAAAACCGCCCAGTCTAGCTATCGCCATGTAAGCCCACTGCAAGCTACCTGCTTTCTCTTTGCGCTTGCGTTTTCCCTTGT 

CCAGATAGCCCAGTAGCTGAWTTCATCCGGGGTCAGCACCGTTTCTGCGGACTGGCTTTCTACGTGTTCCGCTTCCTTTAGCAGCCCTTGCGCCCT 

GAGTGCTTGCGGCAGCGTGMGCTMTTCATGGTTAAATTTTTGTTAAATC^GCTCATTTTTTMCCAATAGGCCGAAATCGGCAAMTCCCT 

MTCAAMGMTAGCCCGAGATAGGGTTGAGTGTTGTTCCAGTTTGGMCMGAGTCCACTATTAAAGMCGTGGACTCCAACGTCAMGGGCGAAA 

MCCGTCTATCAGGGCGATGGCCGGATCAGCTTATGCGGTGTGAAATACXGCACAGATGCGTAAGGAGAAAATACCGCATCAGGCGCTCTTCCGCTT 

CCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGGGA 

TAACGCAGGAAAGAACATGTGAGCARAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTICCATAGGCTCCGCCCCCCT 

GACGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGC 

GCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTATCT 

CAGTTCGGTCTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGMCCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAG 

TCCAACCCGGTAAGACACGACTTATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGA 

AGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTG 

ATCCGGCAAACAAACCACCGCTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGATC 

TTTTCTACTGAACGGTGATCCCCACCGGAATTGCG 
SEQ ID 55 

GCCGCGGAATTTCGACTCTAGGCCATTGCATACGTTGTATCTATATCATAATATGTACATTTATATTGGCTCATGTCCAATATGACCGCCATGTTGA 
CATTGATTATTGACTAGTTATTAATAGTAATCAAITACGGGGTCATTAGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATG 
GCCCGCCTGGCTGACCGCCCMCGACCCCCGCCCATTGACGTCMTAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGICA 
ATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGC(MGTCCGCCCCCTATTGACGTCAATGACGGTAAATGG 
CCCGCCTGGCATTATGCCCAGTACATGACCTTACGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGATGCGGT 
TTTGGCAGTACACCAATGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCA 
AMTCMCGGGACTTTCCAAAATGTCGTAATAACCCCGCCCCGTTGACGCAMTGGGCGGTAGGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCT 
CGTTTAGTGAACCGTCAGATCGCCTGGAGACGCCATCCACGCTGTTTTGACCTCCATAGAAGACACCGGGACCGATCCAGCCTCCGCGGCCGGGAAC 
GGTGCATTGGAACGCGGATTCCCCGTGCCMGAGTGACGTAAGTACCGCCTATAGACTCTATAGGCACACCCCTTTGGCTCTTATGCATGCTATACT 
GTTTTTGGCTTGGGGCCTATACACCCCCGCTTCCTTATGCTATAGGTGATGGTATAGCTTAGCCTATAGGTGTGGGTTATTGACCATTATTGACCAC 
TCCCCTATTGGTGACGATACTTTCCATTACTAATCCATAACATGGCTCTTTGCCACAACTATCTCTATTGGCTATATGCCAATACTCTGTCCTTCAG 
AGACTGACACGGACTCTGTATTTTTACAGGATGGGGTCCCATTTATTATTTACAAATTCACATATACAACAACGCCGTCCCCCGTGCCCGCAGTTTT 
TATTAAACATAGCGTGGGATCTCCACGCGAATCTCGGGTACGTGTTCCGGACATGGGCTCTTCTCCGGTAGCGGCGGAGCTTCCACATCCGAGCCCT 
GGTCCCATGCCTCCAGCGGCTCATGGTCGCTCGGCAGCTCCTTGCTCCTAACAGTGGAGGCCAGACTTAGGCACAGCACAATGCCCACCACCACCAG 

tgtckegcacaaggccgtggcggtagggtatgtgtctg^ 

GCAGAAGAAGATGCAGGCAGCTGAGTTGTTGTATTCTGATAAGAGTCAGAGGTAACTCCCGTTGCGGTGCTGTTAACGGTGGAGGGCAGTGTAGTCT 

GAGCAGTACTCGTTGCTGCCGCGCGCGCCACCAGACATAATAGCTGACAGACTAACAGACTGTTCCTTTCCATGGGTCTTTTCTGCRGTCACCGTCg 

tcgacgccaccatgtgggccaccatcgtgggcaagcgcgccaagggccccgccagcggccccaccaccaactggggcatccccaacagcgccatctg 

cagcagcggcttcagcggcaccaccacccccaccgtgcccagcgtgagcggcaacaagcccgtgaccaccatccagcagctgagccccgccaccagc 

ggcagcgccgccgtggacctgtgcaccatccaggccgtgagcctgctgcccggcgagcccccccagaagacccccaccggcgtgtacggccccctgc 

ccaagggcaccgtgggcctgatcctgggccgcagcagcctgaacctgaagggcgtgcagatccacaccagcgtggtggacagcgactacaagggcga 

gatccagctggtgatcagcagcagcatcccctggagcgccagcccccgcgaccgcatcgcccagctgctgctgctgccctacatcaagggcggcaac 

agcgagatcaagcgcatcggcggcctgggcagcaccgaccccaccggcaaggccgcctactgggccagccaggtgagcgagaaccgccccgtgtgca 

aggccatcatccagggcaagcagttcgagggcctggtggacaccggcgccgacgtgagcatcatcgccctgaaccagtggcccaagaactggcccaa 

gcagaaggccgtgaccggcctggtgggcatcggcaccgccagcgaggtgtaccagagcaccgagatcctgcactgcctgggccccgacaaccaggag 

agcaccgtgcagcccatgatcaccagcatccccctgaacctgtggggccgcgacctgctgcagcagtggggcgccgagatcaccatgcccgccccca 

gctacagccccaccagccagaagatcatgaccaagatgggctacatccccggcaagggcctgggcaagaacgaggacggcatcaagatccccgtgga 

ggccaagatcaaccaggagcgcgagggcatcggcaacccctgcgcttaaagaattcAGACTCGA(XAAGTCTA(3AAAGCCATGGATATCGGATCCAC 

TACGCGTTAGAGCTCGCTGATCAGCCTCGACTGTGCCTTCTAGTTGCCAGCCATCTGTTGTTTGCCCCTCCCCCGTGCCTTCCTTGACCCTGGAAGG 

TGCCACTCCCACTGTCCTTTCCTAATAAAATGAGGAAATTGCATCGCATTGTCTGAGTAGGTGTCATTCTATTCTGGGGGGTGGGGTGGGGCAGGAC 

AGCAAGGGGGAGGATTGGGAAGACAATAGCAGGGGGGTGGGCGAAGAACTCCAGCATGAGATCCCCGCGCTGGAGGATCATCCAGCCGGCGTCCCGG 

AAAACGATTCCGAAGCCCAACCTTTCATAGAAGGCGGCGGTGGAATCGAAATCTCGTGATGGCAGGTTGGGCGTCGCTTGGTCGGTCATTTCGAACC 

CCAGAGTCCCGCTCAGAAGAACTCGTCAAGAAGGCGATAGAAGGCGATGCGCTGCGAATCGGGAGCGGCGATACCGTAAAGCACGAGGAAGCGGTCA 

GCCCATTCGCCGCCAAGCTCTTCAGCAATATCACGGGTAGCCAACGCTATGTCCTGATAGCGGTCCGCCACACCCAGCCGGCCACAGTCGATGAATC 

WGAAAAGCGGCCATTTTCCACCJlTGATATTCGGCMGCAGGCATCGCCATGGGTCACGACGAGATCCTCGCCGTCGGGCATGCGCGCC'rrGAGCCT 

GGCGAACAGTTCGGCTGGCGCGAGCCCCTGATGCTCTTCGTCCAGATCATCCTGATCGACAAGACCGGCTTCCATCCGAGTACGTGCTCGCTCGATG 

CGATGTTTCGCTTGGTGGTCGAATGGGCAGGTAGCCGGATCAAGCGTATGCAGCCGCCGCATTGCATCAGCCATGATGGATACTTTCTCGGCAGGAG 

CAAGGTGAGATGACAGGAGATCCTGCCCCGGCACTTCGCCCAATAGCAGCCAGTCCCTTCCCGCTTCAGTGACAACGTCGAGCACAGCTGCGCAAGG 
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MCGCCCGTCGTGGCCAGCCACGATAGCCGCGCTGCCTCGTCCTGCAGTTCATTCAGGGCACCGGAC^^ 

CCCTGCGCTGACAGCCGGAACACGGCGGCATCAGAGCAGCCGATTGTCTGTTGTGCCCAGTCATAGCCGAATAGCCTCTCCACCCAAGCGGCCGGAG 
AACCTGCGTGCAATCCATCTTGTTCAATCATGCGAAACGATCCTCATCCTGTCTCTTGATCAGATCTTGATCCCCTGCGCCATCAGATCCTTGGCGG 
CAAGAAAGCCATCCAGTT'PACTTTGCAGGGCTTCCCAACCTTACCAGAGGGCGCCCCAGCTGGCAATTCCGGTTCGCTTGCTGTCCATAAAACCGCC 

CAGTCTAGCTATCGCCATGTMGCCCACTGCMGCTACCTGCT^ 

TCCGGGGTCAGCACCGTTTCTGCGGACTGGCTTTCTACGTGTTCCGCTTCCTTTAGCAGCCCTTGCGCCCTGAGTGCTTGCGGCAGCGTGAAGCTAA 
TTCATGGTTAAATTTTTGTTAAATCAOKJTCA^ 

TTGAGTGTTGTTCC^GTTTGGMCAAGAGTCCACTATTAMGMOSTGGACTCCMCGTCAAAG^ 

TCAGCTTATGCGGTGTGAAATACCGCACAGATGCGTAAGGAGAAAATACCGCATCAGGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCGCTCG 

GTCGTTCQGCTGCGGCGAGCGGTATCAOTCACTCJ^ 

AAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGT^^ 

MGTCAGWMTGGCGAAM 

ACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCACGCTOTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCA 
AGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATC 
GCCACTGGCAGCA(?CCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACT 
AGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTA 
GCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAMGGATCTCMGMGATCCTTTGATCTTTTCTACTGAACGGTGATCCCCACC 

GGAATTGCG 
SEQ ID 5(5 

GCCGCGGAATTTCGACTCTAGGCCATTGCATACGTTGTATCTATATCATMTATGTACATTTATATTGGCTCATGTCCAATATGACCGCCATGTTGA 
CATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATG 
GCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGACGTCMTAATGACGTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCA 
ATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTCCGCCCCCTATTGACGTCAATGACGGTAAATGG 
CCCGCCTGGCATTATGCCCAGTACATGACCTTACGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGATGCGGT 
TTTGGCAGTACACCAATGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCA 

AAATCAACGGGACTTTCCAAAATGTCGTAATMCCCCGCCCCGTTCtfiC 

CGTTTAGTGAACCGTCAGATCGCCTGGAGACGCCATCCACGCTGTTTTGACCTCCATAGAAGACACCGGGACCGATCCAGCCTCCGCGGCCGGGAAC 
GGTGCATTGGAACGCGGATTCCCCGTGCCAAGAGTGACGTAAGTACCGCCTATAGACTCTATAGGCACACCCCTTTGGCTCTTATGCATGCTATACT 
QYTTTTGGCTTGGGGCCTATACACCCCCGCTTCCTTATGCTATAGGTGATGGTAXAGCTTAGCCTATAGGTGTGGGT^ 

TCCCCTATTGGTGACGATACTTTCCATTACTAATCCATAACATGGCTCTTTGCCACAACTATCTCTATTGGCTATATGCCAATACTCTGTCCTTCAG 
AGACTGACACGGACTCTGTATTTTTACAGGATGGGGTCCCATTTATTATTTACAAATTGACATATACAACAACGCCGTCCCCCGTGCCCGCAGTTTT 

TATTAAACATArcGTGGGATCTCtt^ 

GGTCCCATGCCTCCAGCGGCTCATGGTCGCTCGGCAGCTCCTTGCTCCTAACAGTGGAGGCCAGACTTAGGCACAGCACAATGCCCACCACQACCAG 
TGTGCCGCACAAGGCCGTGGCGGTAGGGTATGTGTCTGAAAATGAGCTCGGAGATTGGGCTCGCACCGCTGACGCAGATGGAAGACTTAAGGCAGCG 

gcmmgaagmgc^^ 

GAGCAGTACTCGTTGCTGCCGCGCGCGCCACCAGACATAATAGCTGACAGACTAACAGACTGTTCCTTTCCATGGGTCTTTTCTGCAGTCACCGTCg 

tcgacgccaccatgaacaagagccgcaagcgccgcaaccgcgagagcctgctgggcgccgccaccgtggagccccccaagcccatccccctgacctg 

gaagaccgagaagcccgtgtgggtgaaccagtggcccctgcccaagcagaagctggaggccctgcacctgctggccaacgagcagctggagaagggc 

cacatcgagcccagcttcagcccctggaacagccccgtgttcgtgatccagaagaagagcggcaagtggcgcatgctgaccgacctgcgcgccgtga 

acgccgtgatccagcccatgggccccctgcagcccggcctgcccagccccgccatgatccccaaggactggcccctgatcatcatcgacctgaagga 

ctgcttcttcaccatccccctggccgagcaggactgcgagaagttcgccttcaccatccccgccatcaacaacaaggagcccgccacccgcttccag 

tggaaggtgctgccccagggcatgctgaacagccccaccatctgccagaccttcgtgggccgcgccctgcagcccgtgcgcgagaagttcagcgact 

gctacatcatccactgcatcgacgacatcctgtgcgccgccgagaccaaggacaagctgatcgactgctacaccttcctgcaggccgaggtggccaa 

cgccggcctggccatcgccagcgacaagatccagaccagcacccccttccactacctgggcatgcagatcgagaaccgcaagatcaagccccagaag 

atcgagatccgcaaggacaccctgaagaccctgaacgacttccagaagctgctgggcgacatcaactggatccgccccaccctgggcatccccacct 

acgccatgagcaacctgttcagcatcctgcgcggcgacagcgacctgaacagcaagcgcatgctgacccccgaggccaccaaggagatcaagctggt 

ggaggagaagatccagagcgcccagatcaaccgcatcgaccccctggcccccctgcagctgctgatcttcgccaccgcccacagccccaccggcatc 

atcatccagaacaccgacctggtggagtggagcttcctgccccacagcaccgtgaagaccttcaccctgtacctggaccagatcgccaccctgatcg 

gccagacccgcctgcgcatcatcaagctgtgcggcaacgaccccgacaagatcgtggtgcccctgaccaaggagcaggtgcgccaggccttcatcaa 

cagcggcgcctggaagatcggcctggccaacttcgtgggcatcatcgacaaccactaccccaagaccaagatcttccagttcctgaagctgaccacc 

tggatcctgcccaagatcacccgccgcgagcccctggagaacgccctgaccgtgttcaccgacggcagcagcaacggcaaggccgcctacaccggcc 

ccaaggagcgcgtgatcaagaccccctaccagagcgcccagcgcgccgagctggtggccgtgatcaccgtgctgcaggacttcgaccagcccatcaa 

catcatcagcgaqagcgcctacgtggtgcaggccacccgcgacgtggagaccgccctgatcaagtacagcatggacgaccagctgaaccagctgttc 

aacctgctgcagcagaccgtgcgcaagcgcaacttccccttctacatcacccacatccgcgcccacaccaacctgcccggccccctgaccaaggcca 

acgagcaggccgacctgctggtgagcagcgccctgatcaaggcccaggagctgcacgccctgacccacgtgaacgccgccggcctgaagaacaagtt 

cgacgtgacctggaagcaggccaaggacatcgtgcagcactgcacccagtgccaggtgctgcacctgcccacccaggaggccggcgtgaacccccgc 

ggcctgtgccccaacgccctgtggcagatggacgtgacccacgtgcccagcttcggccgcctgagctacgtgcacgtgaccgtggacacctacagcc 

acttcatctgggccacctgccagaccggcgagagcaccagccacgtgaagaagcacctgctgagctgcttcgccgtgatgggcgtgcccgagaagat 
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caagaccgacaacggccccggctactgcagcaagqccttccagaagttcctgagccagtggaagatcagccacaccaccggcatcccctacaacagc 
cagggccaggccatcgtggagcgcaccaaccgcaccctgaagaccpagctggtgaagcagaaggagggcggcgacagcaaggagtgcaccacccccc 
agatgcagctgaacctggccctgtacaccctgaacttcctgaacatctaccgcaaccagaccaccaccagcgccgagcagcacctgaccggcaagaa 
gaacagcccccacgagggcaagctgatctggtggaaggacaacaagaacaagacctgggagatcggcaaggtgatcacctggggccgcggcttcgcc 
tgcgtgagccccggcgagaaccagctgcccgtgtggatccccacccgccacctgaagttctacaacgagcccatccgcgacgccaagaagagcacca 
gcgccgagaccgagaccagccagagcagcaccgtggacagccaggacgagcagaacggcgacgtgcgccgcaccgacgaggtggccatccaccagga 
gggccgcgccgccaacctgggcaccaccaaggaggccgacgccgtgagctacaagatcagccgcgagcacaagggcgacaccaacccccgcgagtac 
qccqcctgcagcctggacgactgcatcaacggcggcaagagcccctacgcctgccgcagcagctgcagcttaaagaattcRGACTCGAGCMGTCTA 
GARAGCCATGGATATCGGATCCACTACGCGTTAGAGCTCGCTGATCAGCCTCGACTGTGCCTTCTAGTTGCCAGCCATCTGTTGTTTGCCCCTCCCC 
CGTG(XTTCCTTGACCCTGGAAGGTGCCACTCCCACTGTCCTTTCCTAATAAMTGAGGARATTGCATCGCATTGTCTGAGTAGGTGTCATTCTATT 
CTGGGGGGTGGGGTGGGGCAGGACAGCMGGGGGAGGATTGGGMGACAATAGCRGGGGGGTGGGCGRAGAACTCCAGCATGAGATCCCCGCGCTGG 

TCGCTTGGTCGGTCATTTCGAACCCCAGAGTCCCGCTCAGAAGAACTCGTCAAGAAGGCGATAGAAGGCGATGCGCTGCGAATCGGGAGCGGCGATA 
CCGTAAAGCACGAGGAAGCGGTCAGCCCATTCGCCGCCAAGCTCTICAGCAATATCACGGGTAGCCAACGCTATGTCCTGATAGCGGTCCGCCACAC 
CCAGCCGGCCACAGTCGATGAATCCAGAAAAGCGGCCATTTTCCACCRTGAXATTCGGCAAGCAGGCATCGCCATGGGTCACGACGAGATCCTCGCC 
GTCGGGCATGCGCGCCTTGAGCCTGGCGAACAGT'TCGGCTGGCGCGAGCCCCTGATGCTCTTCGTCCAGATCATCCTGATCGACAAGACCGGCTTCC 

atoh»ctacgtgctcgctc<»t«^^ 

tgatggatactttctcggcaggagcaaggtgagatgacaggagatcctgccccggcacttcgcccaatagcagccagtcccttcccgcttcagtgac 
aacgtcgagcacagctgcgcaaggaacgcccgicgtggccagccacgatagccgcgctgcctcgtcctgcagttcattcagggcaccggacaggtcg 

GTCTTGACAAAAAGMCCGG«:GCCCCTGCGCTGACAGCCG(aaCACGGCG(^ 
GCCTCTCCACCCAAKGGCCGGAGAArcTGCGTGCMTCC^TCTTCT^^ 

CCTGCGCCATCAGfiTCCTTGGCGGCRAGAAAGCCATCCAGTTTACTTTGCAGGGCTTCCCAACCTTACCAGAGGGCGCCCCAGCTGGCAATTCCGGT 
TCGCTTGCTGTCCATAARACCGCCCAGTCTAGCTATCGCCATGTAAGCCCACTGCAAGCTACCTGCTTTCTCTTTGCGCTTGCGTTTTCCCTTGTCC 
AGATAGCCCAGTAGCTGACATTCATCCGGGGTCAGCACCGTTTCTGCGGACTGGCTTTCTACGTGTTCCGCTTCCTTTAGCAGCCCTTGCGCCCTGA 
GTGCTTGCGGCAGCGTGMGCTMTTCATGGTTAMTTTTTGTTAMTCAGCTCATTTT^ 

TCAAAAGMTAGCCCGAGATAGGGTT^GAGTGTTGTTCCAGrTTGGAACMGAGTCCftCTATTAAAGMCGTGGACTCCMCGTCAMGGGCGAAMA 
CCGTCTATCAGGGCGATGGCCGGATCAGCTTATGCGGTGTGAAATACCGCACAGATGCGTAAGGAGAAAATACCGCATCAGGCGCTCTTCCGCTTCC 
TCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGGGATA 
ACGCAGGAAAGAACATGTGAGCAAAAGGCGAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGA 
CGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGC 
TCTCCTGTXCCGACCCTGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCXTTCTCATAGCTCACGCTGTAGGTATCTCA 
GTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTC 
CAACCCGGTAAGACACGACTTATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCIACAGAGTTCTTGAAG 
TGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGAT 
CCGGCAAACAAACCACCGCTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGATCTT 

TTCTACTGAACGGTGATCCCCACCGGAATTGCG 

ATGAACCCAKAGAGATGCAAAGAAAAGCACCTCCGCGGAGACGGAGACATCGCM 

CAGMGMCAGATGAAGTTGCCATCC^CCMGMGGCAGAGCCGCCAACTTGGGCACAACTAAAGAAGCTGACGCAGTTAGCTACAAAATA^^ 
GAACACAAAGGTGACACAAACCCCAGAGAGTATGCTGCTTGCAGCCTTGATGATTGTATCAATGGTGTCTGCAGGTGTACCCAACAGCTCCGAAGAG 

ACAGCGACCATCGAGAACGGGCCATGA 

GC^GGAKAGATGAAGCTGCCCAGCACCAAGAAGGCCGAGCCCCCCACCTGGGCCCAGCT^ 

GAACACCAAGGTGACCCAGACCCCCGAGAGCATGCTGCTGGCCGCCCTGATGATCGTGAGCATGGTGAGCGCCGGCGTGCCCAACAGCAGCGAGGAG 
ACCGCCACCATCGAGAACGGCCCCGCTTAA 

? T GAACCCATC^^ 

CAGAAGMCAGATGAAGTTGCCATCCACCAAGAAGG(^GAGCCGCCAACTTGGGCACAACTAAAGAAGCTGACGCAGTTAGCTACAAAATATCTAGA 
GAACACAAAGGTGACACAAACCCCAGAGAGTATGCTGCTTGCAGCCrrGATGATTGTATCAATGGTGGTGTACCCAACAGCTCCGAAGAGACAGCGA 
CCATCGAGAACGGGCCATGATGACGATGGCGGTTTTGTCGAAAAGAAAAGGGGGAMTGTGGGGAAAAGCMGAGAGATCAGATTGTTACTCT 

GTGTAGAAAGAAGTAGACATAGGAGACTCCATTTTGTTCTGTACTAA 

SEQ ID 60 

ATGAACCCCAGCGAGATGCAGCGCAAGGCCCCCCCCCGCCGCCGCCGCCACCGCAACCGCGCCCCCCTGACCCACAAGATGAACAAGATGGTGACCA 
GCGAGGAGCAGATGAAGCTGCCCAGCACCAAGAAGGCCGAGCCCCCCACCTGGGCCCAGCTGAAGAAGCTGACCCAGCTGGCCACCAAGTACCTGGA 
GAACACCAAGGTGACCCAGACCCCCGAGAGCATGCTGCTGGCCGCCCTGATGATCGTGAGCATGGTGGTGTACCCCACCGCCCCCAAGCGCCAGCGC 
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GCGTGGAGC6CAGCCGCCACCGCCGCCTGCACTTCGTGCTGTRCGCTTM 
CT^<&CCCCACMCW^ 

AATAAGCCAGTTACCACAATACAACAATTGTCCCCCGCCACAAGCGGCAGTGCAGCAGTAG 

gmgaggacagcgtgagcgtgagcgacgcccccggcagctgcatcatcgactgcmcgagaacacccgcaaga^^ 
?gcactSgS^ 

SSSScScCAOTGC^ 

S?S?ScSStgcaggacgtggc 

catcagccagctgccccagtacaacaactgcccccccccccaggccgccgtgcagcaggcttaa 

*g?gg£a?^^ 

???cSaacaacccc^ 

SSSaSactatacmgcS 

SlGGACmS 
TTMTAGCTCTTCAMTCCWGGAGTGCCM 

AAGAATAGGAGGGCTTGGAAGCACTGATCCAACAGGAAAGGCTGCATATTGGGCAAGTCAGGTCTCAGAGMCA 
CMGGAAAACAGTTTGAAGGGTTGGTAGACACTGGAGCAGATGTCTCTATX^TTGCT^ 
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ATCAAGAAAGAGAAGGAATAGGGAATCCTTGCTAG 
ATCTCGGCCACCATCGTGGG^^ 

TOIGCGGCACCACCACCCCCACCGTGCCCAGCGTGAGCGGCAACAAGCCCGTGACCACCATCCAGCAGCT 

CGTGGACCTGTGCACCATCCAGGCCGTGAGCCTGCTGCCCGGCGAGCCCCCCCAGAAGACCCCCACCGGCGTGTACGGCCCCCTGCCCMGGGCACC 

CTGGGCCTGATCCTGGGCCGCAGCAGCCTGAACCTGAAGGGCGTGCAGATCCACACCAGCGTGGT 

TGATCAGCAGCAGCATCCCCTGGAGCGCCAGCCCCCGCGACCGCATCGCCCAGCTGCTGCTGCTGCC 

GCGCATCGGCGGCCTGGGCAGCACCGACCCCACCGGCAAGGCCGCCTACTGGGCC^ 
' CAGGGCAAGCAGTrCGAGGGCCTGGTGGACRCCGGCGCCGACGTGAGCATCATCGCCCTGMCCAGTGGCCCMGMCTGGCCCMGCAGMGGCCG 

TGACCGGCCTGGTGGGCATCGGCACCGCCAGCGAGGTGTACCAGAGCACCGAGATCCTGCACTGCCTGGGCCCCGACAACCAGGAGAGCACCGTGC^ 
(^CATGATCACCAGCATCCCCCTGAACCTGTGGGGCCGCGACCTG 

ACCAGCCAGAAGATCATGACCAAGATGGGCTACATCCCCGGCMGGGCCTGGGCMGAACGAGGACGGCATCMGATCCCCGTGGAGGCCAAGATCA 

accaggagcgcgagggcatcggcaacccctgcgcttaa 
atgaataaI™ 

AACCAGTGTGGGTAMTCAGTGGCCGCTACCAAAACAAAAACTGGAGGCTTTACATTTAT^^ 
WCGTTCTCACCTTG(^TTCTCCTGTGTTTGTMTT(^GAAGAAATCAGGCAAATGGCGTATGTTM 

JIacccatggg^^ 

CCATCCCTCTGGCAGAGCAGGATTGCGAAAAATTTGCCTTTACTATACCAGCCA 
ACCTCAGGGAMGCTT^ 

CATTGTATTGATGATATTTTATGTGCTGCAGAAACGAAAGATAMTTAATTGACTGTTATACRTTTCTGCAAGCAGAGGTTGCCAATGCTGGACTGG 
CRATAGCATCTGATAAGATCCAAACCTCTACTCCTTTTCATTATTTAGGGATGCAGATAGAAAATAGAAAAATXAAGCCACAAAAAATAGAAATAAG 

^Scacattaaaaacactaaatgattttc^ 

AAWTGTTCTCTATCTTMGAGGAGACTCAGACTTAMTAGTAAMGAATGTTAACCCCAGAGGCAACAAAAGAAATTAAATTAGTGGAAGAAAAAA 

Scagtcagcgcaaataaatagaata^tcccttagccccactccmcttttgatt^ 

TACTGATCTTGTGGAGTGGTCATTCCTTCCTCACAGTACAGTTAAGACTTTTAC^ 

GGAAGATTGGTCTTGCTMTTTTGTGGGAATTATTGATMTCATTACCCAAAAACAMGATCTTCCAGTTCTTAAAATTGACTACTTGGATTCTACC 
TAAAATTACCAGACGTGAACCTTTAGAAAATGCTCTAACAGTATTTACTGATGGTTCCAGCAATGGAAAAGCACCTTACACAGGACCGAAAGAACGA 

GTAATCAAAACTCCATATCAATCGGCTCAAAGAGCAGAGTTGGTTGC&G 

aS^tgtagtacaggctacmggga^ 

ACA^CTGTMGAAAAAGAMTTTCCCATTTTATATTACA 

gaotactgctaStgcactcataaaagcacaagaacttcatgcttt 

SraGGCAAAAGATATTGTACAACATTGCACCCAGTGTCMCT^ 

GCAACTTGCCAMCAGGAGAAAGTACTTCCCATGTTAAAAAACATTTATTGTCTTGTTTTGCTGTAATGGGAGTTCCAGAAAAAATCAAAACTGACA 
ATGGACCAGGATATTGTAGTAAAGCTTTCCAARAATTCTTAAGTCAGTGGAAMTTTCACATACAACAGGMTTCCTTATAATTCCCAAGGACAGGC 

CATAGTTGAAAGMCTMTAGAACACTCAAAACTCAATTAGTTAAACAAA 

AATCTAGCACTCTATACTTTAAATTTTTTAAACATTTATAGAAATCAGACTACTACTTCTGCAGAACAACATCTTACTGGTAAAAAGAACAGCCCAC 
aS(*AAAACT^ 

AGGAGAAAATCAGCTTCCTGTTTGGATACCCACTAGACATTTGAAGTTCTACAATGAACCCATCAGAGATGCAAAGAAMGCACCTC 
GAGACATCGCAATCGAGCACCGTTGACTCACMGATGAACAAMTGGTGACGTCAGAAGAACAGATGAAGTTGCCATCCACCAAGMGGC 

cc^cttgggcacaactaaagm«:tgacgcagtogctacaaaatatctagagaacacaaaggtgacac^ 
ccttgatgattgtatcaatggtggtaagtctccctatgcctgcaggagcagctgcagctaa 

?tgaacaagakcgcmgcgccgcmccgcga<3agcctgctgggcgccgccaccg^ 
agcccgtctgggtLccagtggcccctgcccaagca^^ 

CAGOTCAGCCCCTGGMCAGCCCCGTGTTCGTGATCCAGAAGMGAGCGGmGTGK 

Satccccctggcc^gcaggactgcgag^^ 

GCCCCAGGGCATGCTGMCAGCCCCACCATCTGCCAGACCTTCGTGGGCCGCGCCCTGCAGCCCGTGCGCGAGAAOT 
CACTGCATCGACGACATCCTGTGCGCCGCCGAGACCAAGGACAAGCTGATCGACTGCTA(^CCTTCCTGCAGGCCGA 

CCATCGCCAGCGACAAGATCCAGACCAGCACCCCCTTCCACTACCTGGGCATGCAGATCGAGAACCGCAAGATCAAGCCCCAGAAGATCGAGATCCG 

flMCTGTTCAGCATCCTGCGCGGCGACAGCGACCTGAACAGCMGCGCATGCTGACCCCCGAGGCCACCAAGGAGATCAAGCTGGTG 
TCCAGAGCGCCOVGATCMCCGCATCGACCCCCTGGCCCCCCTGCAGCTGCTGATCTTC(3CCACCGCCCACAGCCCCACCGGCATCATC 
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CACCGRCCTGGTQGAGTGGAGCTTCCTGCCCCACRGCACCGTGAAfiACCTTCMCCTGTACCTC 
SgCgIScAA^ 

S^DOCQCCGCGMCCa 

SgSSSctacc^ 

MMCGOCTRCGMGTKaBGCCAa 
ScCT^CCCTGTAcScTGMCTlCCTGM^ 

SgS^cagc^ 
SSSagSgcacc^ 

cctggacgactgcatcaacggcggcaagagcccctacgcctgccgcagcagctgcagcgcttaa 

tatiengpa 
SEQ ID 68 



psrtghdddggfvekkrgkcgekqersdcycvcversrhrrlhfvlya 
peSSotpIgswise 

PRCKKGKHWASQCRSKFDKNGQPLSGNEQRGQPQAPQQTGAFPIQPFVPQGFQGQQPPLSQVFQGISQLPQYNNCPPPQMVQQ 

PRCKKGKHWASQCRSKFDKNGQPLSGHEQRGQPQAPQQTGAFPIQPFVPQGFQGQQPPIiSQVFQGISQliPQYNNCPPPQAAVQQA 

TSQKIMTKMGYIPGKGLGKNEDGIKIPVEAKINQEREGIGNPC 
WOTVGKRMGP^^ 

QGKQFEGLVDTGADVSIIALNQWPKNWPKQKAVTGLVGIGTASEVYQSTEILHCLGPDNQESTVQPMITSIPLNLWGRDLLQQWGAEITMPAPSYSP 
TSQKIMTKMGY IPGKGLGKNEDGI KI PVEAKINQEREGIGNPCA 

QPMGPLQPGLPSPAMIPKDWPLIIIDLKDCFFTIPLAEQDCEKFAmPAIW^^ 

NLFSILRGDSDLNSKRMIiTPEATKEIKLVEEKIQSAQIlJRIDPlAPLQLLIFATAHSPTGIIIQNTDLVEWSFIiPHSTVKTFTLYLDQIATLIGQTR 



.OS2O03 
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t R t r^rNDPDKIWPLTKEQVRQ^^^ 

======= 

ET^SSTVDSQDEQNGDVRRTDEVAIHQEGRAANLGTTKEADAVSYKISREHKGDINPREYRACSLDDCINGGKSPYACRSSCS 
NLAIYTLNFLNIYRNQ 



KTCGAGGTGGGACATGCTGGCGGCMTACTGCTCTTTAAGK 



GTGCCCAMZGTGGATGCTTTTCTCTAGGGTGAAGGGACTCTCGAGTC^ 



CTTTCTCTGGGGACCCAGAGAGAAGGAGGCGTTTTCACCACAGCCGAACAGGGCAGGACCCCAGCACCCGGGACCCAGCGGGACT^ 

w< 

IT 
FT 

CCTGAGATACGGCCTCGTGGGAAGGGAAAGACCTGACO TCCctcMC^GA^ 



■^Arr.Tfi^TGGGCCACGCGGCTGTTM 



rAACAATGTGTTTACAGGCAGTATGCTTGGTAAAAGTCATCGCCATTCTCCATT 



^^^^^^ 
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TCTTGAGGCATGGCGGTTCCTGGTMTWTACMCTGGTACAGGCCEGGGMGRGACTCflflGTAGGftGCGCCTGCCCGAGCTGAGACTAGATGTGAA 
CCTTTCACCATGAAAATGTraAAAGAIATAMGGAAGGAGTTAAACAATATGGAICCAACTC^C^ 

ATGGAAATAGACTTACTCCTTATGACTGGGAAATTTTGGCCAAATCTTCX3CTTTCATCCTCTCAGTATCTACAGTTTAAMCCTGGTGGATTGA 

AGTACAAGAACAGGTACGAAAAAATCAGGCTACTAAGCCCACTGTTMTATAGACGCAGACCAATTGTTAGGMCAGGTCCAAATTGGAGCACCATT 

AACCMCMTCAGTGATGCAGAATGAGGCTATTGMCAAGTAAGGGCTATTTGCCTCAGGGCCTGGGGAAAMTTCAGGACCCAGGAACAGCTTTCC 

CTATTMTTCAATTAGAC^GGCTCTAAAGAGCCATATCCTGACTTTGTG^ 

CCGA?^GTTATTGTAGAATTAATGGCCTATGAAMTGCAMrCCAGAATGTCAGTCGGCCATAMGCCATT7VAMGGAAAAGTTCCAGCAGGAGTT 
GATGTAATTACAGAATATGTGAAGGCTTGTGATGGGATTGGAGGAGCTATGCATAAGGCAATGCTAATGGCTCAAGCAATGAGGGGGCTCACTCTAG 
GAGGACAAGTTAGAACATTTGGGAAAAAATGTTATAATTGTGGTCAAATCGGTCATCTGAAAAGGAGTTGCCCAGGCTTAAATAAACAGAATATAAT 
AAATCAAGCTATTACAGCAAAAAATAAAAAGCCATCTGGCCTGTGTCCAAAATGTGGAAAAGCAAAACATTGGGCCAATCAATGTCATTCTAAATTT 
GATARAGATGGGCAACCATTGTCTGGAAACAGGAAGAGGGGCCAGCCTCAGGCCCCCCRACAAACTGGGGCATTCCCAGTTAAACTGTTTGTTCCTC 
AGGGTTITCAAGGACAACAACCCCTACAGAAAATACCACCACTTCAGGGAGTCAGCCAATTACAACAATCCAACAGCTGTCCCGCGCCACAGCAGGC 
AGCACCGCAGTAGATTTATGTTCCACCCAAATGGTCTTTTTACTCCCTGGAAAGCCCCCACAAAAGATTCCTAGAGGGGTATATGGCCCGCTGCCAG 
AAGGGAGGGTAG<XXTTTGAGGGAGATCAAGTCTAAATTTGAAGGGAGTCCAAATTCAT^ 

GTTAGTGATCAGCTCCACTGTTCCCCGGAGTGCCMTCCAGGTGATAGMTTGCTCMTTACTGCTTTTGCCTTATGTTAAAATTGGGGAAAACAAA 
AAGGAAAGAACAGGAGGGTTTGGAAGTACCAACCCTGCAGGAAAAGCTGCTTATTGGGCTAATCAGGTCTCAGAGGATAGACCCGTGTGTACAGTCA 
CTATTCAGGGAAAGAGTTTGMGGATTAGTGGATACCCAGGCTGATGTTTCTGTC^TCGGCATAGGTACTGCCTCAGAAGTGTATCAAAGTGCCATG 
ATTTTACATTGTCCAGGATCTGATAATCAAGAAAGTACGGTTCAGCCTGTGATCACTTCATTCCMTCAATTTATGGGGCCGAGACTTGTTACAACA 
ATGGCATGCAGAGATTACTATCCCAGCCTCCCTATACAGCCCCAGGMTAAAAAAATCATGACTAAAATGGGATAGCTCCCTAAAAAGGGACTAGGA- 

MGMGTCCCAATTGAGGCTGAAAAAAATCAAAAAAGAA^^ 

TTAACTTGGGGGAAAAAAAMCAACTGTATGGTAMTCAGCAGCGCTTCCAAAACAAAAACTGGAGGCTTTACATTTATTAGCAAAGAAACAATTAG 

aaaaaggacattgagccttcattttcgccttggaattctgtttgtaattcagaaaaaatccggcagatggcgtataatgccgtmttcaacccatgg 

GGCXTCTCCCACCCCGGTTKCCTCTC^^ 
ATTTTGAAAAATTTGCTTmCCACACCAGCCT 

TCAACTATTTGTCAGCTCAAGCTCTGCAACCAGTTAGAGACAAGTTTTCAGACTGTTACATCGTTCACTATGTTGATATXTTGTGTGCTGCAGAAAC 
GAGAGACAAATTAATTGACCGTTACACATTTCTGCAGACAGAGGTTGCCAACGCGGGACTGACAATAACATCTGATAAGATTMAACCTCTACTCCT 

TTCCGmCTTGGGMTKAGGTAGAGGAAAGGAAAATC^ 

AGTTGCTAGGAGATACTMTTGGATTTGGAGATATTAATTGGATTTGGCCMCTCTAGGCATTCCTACTTATGCCATGTCAAATTTGTTCTCTTTCT 
TAAGAGGGGACTCGGAATTAAATAGTGAAAGAACGTTAACTCCAGAGGCMCTA^ 

AAATAGAATAGATCACTTGGCCCCACTCCAAATTTTGATTTTTGCTACTGCAGATTCCCTAACAGGCAIWTTGTTCAAAATACAGATCTTGTGGAG 
TGGTCCTTCCTTCCTCACAGTACMTTAAGACTTTTACATTGTACTTC^ 

TGTGTGGAAATGACCCAGATAAAATCACTGTTCCTTTCAACAAGCAACAGGTTAGACAAGCCTTTATCAATTCTGGTGCATGGCAGATTGGTCTTGC 
CGATTTTGTGGGMTTATTGACMTCGTTACCCCAAAACAAAAATCTTCCAGWTTrAAAATTGACTACTTGGATTTTACCTAAAGTTACCAAACAT 

AAGCCTTTAAAAAATGCTCTGGCAGTGTTTACTGATGGm^ 

ATCACTTGACTCAAAGAGCAGAGTTGGTTGCCGTCATTACAGTGTTAACAAGATTTTAATCAGTCTATTAACATTGTATCAGATTCTGCATATGTAG 
TACAGGCTACAAAGGATATTGAGAGAGCCCTAATCAAATACATTATGGATGATCAGTTAAACCCGCTGXTTAATTTGTTACAACAAAATGTAAGAAA 
AAGAAATTTCCCATTTTATATTACTCATATTCGAGCACACACTAATTTACCAGGGCCTTTAACTAAAGCAAATGAACAAGCTGACTTGCTAGTATCA 
TCTGCATTCaTGGMGCACAAGAACTTCATGCCTTGACTCATGTAAATGCMTAGGATTAAAAM 

ATATTGTACAACATTGCACCCAGTGTCAGATTCTACACCTGGCCACTCAGGAGGCAAGAGTTAATCCCAGAGGTCTATGTCCTAATGTGTTATGGCA 
MTGGATGTCATGCACGTACCTTCATTTGGAAMTTGTCATTTGTCCATGTGACAGTTGATACTTATT(^CATTTCATATGGGCAACCTGCCAGACA 
GGAGAAAGTACTTCCCATGTTAAAAGACATTTATTATCTTGTTTTCCTGTCATGGGAGTTCCAGAAAAAGTTAAAACAGACAATGGGCCAGGTTACT 
GTAGTAAAGCAGTTCAAAAATTCTTAAATCAGTGGAAAATTACACATACAATAGGAATTCTCTATAATTCCCAAGGACAGGCCATAATTGAAAGAAC 
TAATAGAACACTCAMGCTCMTTGGTXAMCAAAAAAMGGAAMGACAGGAGTATAACACTCCCCAGATGCAACTTAATCTAGCACTCTATACTT 
TAAATGTTTTAAACATTTATAGAAATWGACCACTACCTCTGCAGAACMCATCTTACTGGXAAAAGGMCAGCCCACATGAAGGAAAACTGATTTG 
GTGGAAAGATMTAAAAATAAAACATGGGAMTGG03Ma;TGATAACGTGGGG(3AGAGGT 

GTTTGGATACCCACTAGACATTTAAAGTTCTACAATGAACTCACTGGAGATGCAAAGAAAAGTGTGGAGATGGAGACACCCCAATCGACTCGCCAGG 
TAAAC^AAATGGTGATATCAGAAGMCAGAAAAAGTTGCCTTCCATCAAGGAAGCAGAGTTGCCAATATAGGCACAATTAAAGAAGCTGACACAGTT 

AKTAAAAAAAAAAGCCTAGAt^TACAAAGGTGACACC^ 

CCCMGTCTGCAGGAGCAGCTGCAGCTAATTATACTTACTGGGCCTATGTGCCTTTCCCACCCTTAATTCGGGCAGTTACATAGATGGATAATCCTA 
TTGAAGTAGATGTTAATAATAGTGCATGGGTGCCTGGCCCCACAGATGACTGTTGCCCTGCCCAACCTGAAGAAGGAATGATGATGAATATTTCCAT 

TGGGTATCCTTATCCTCCTGTTTGCCTAGG(^GGCA^ 

ACCAGTAGATTTACTTATCACATGGTAAGTGGAATGTCACAGATAAATAATTTACAGGACCCTTCTTATCAAAGATCATTACAATGTAGGCCTAAGG 
GGMGGCTTGCCCCAAGGAAATTCCCAAAGAATCAAAAAGCCCAGAAGTCTTAGTCTGCGGAGAATGTGTGGCTGATACTGCAGTGTAGTACAAAAC 
MTGAATTTTGAACTATGATAGACTGGGTCCCTTGAGGCCMTTATATCATAACTGTACAGGCCAGACTCATTCATGTTCACAGGCCCCATCCATCT 
GGCCCATTAATCCAGCCTATGACGGTGATGTMCTGAAAGGCTGGACCAGGTTTATAGMGGTTAGAATCACTCTGTCCMGGAMTGGGGTGAAAA 
GGGAATTTCATCACCTTGACCAAAGTTAGTCCTGTTACTGGTCCTGAACATCCAGAATTAGGAAGCTTACTGTGGCCTCACACCACATTAGAATTTG 
TTCTGGAAATCAAGCTATAGGAACAAGAGATCGTAAG1CATATTATACTATCAACCTAAATTCCAGTCTGACAATTCCTTTGCAAAATTGTGTAAAA 
CTCCCTTATATTGCTAGTTGTAGGAAAAACATAGTTATTAAACCTGATTCCCAAACCATAATCTGTGAAAATTGTGGAATGTTTACTTGCATTGATT 
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TGACTCTTMTTGGCAGCACCGTATTCTACTA5GAAGAGCMGAGAG 

SagSact^ 

StCaIS 

TCTGCAAG(aGGAGAAGATAATCTTACTnAGACAnTCAJ^ 
GAGTGCAGTGGCGTGATCTCAGCTCA^ 

rr^CCftCCATGCCTGGCTMTTTTTTTTGGGTTTTTAATAGAGATGGAGTWCACCGTGTTAGCC^GGATGGTCTCGATCTCCTGACCTTGTGATC 
TGCCCACCTTGGCCTCCCAAAGTGCTGGGATTACAGTCGTGAGCCACCGTGCCCAGCCAAGAAAAAATTTTTGAGGCATCAAMGCCCATTT 

TKTKCAGGAA(^ 

ATAGTGACGATGGTGGTTTTGTCAAAMGAAAAGGGGGGGATATGTMGGA 

ccattctcga^^ 

JJoTTGAGGG^^ 

cSSSotccataStgagcgccgg 
cacctgatgagaaatacccacaggtgtggaggggctggcccccttca 

TCAAGCGGTGTGTCCTGTGCCAAATAGCACGGCACCATACCAGCMCCCACAGCGATGGCGTCTMTTCACCAGCMCACAGGACGCGGCGCTGTAT 

SSSgSS 

======= 

SS5=SSSS==SS=SSSSS===SS^ 

tgatgtaattacagaatatgtgmggcttgtgatgggattggaggag^ 
cagcaccgcagtaa 



t&OH^SJL S^ii *, 0S2003 

23/25 
SEQ ID 77 

ATGGGCCAGACCGAGAGCAAGTACGCCAGCTACCTGAGCTTCATCAAGATCCTGCTGCGCCGCGGCGGCGTGCGCGCCAGCACCGAGAACCTGATCA 
CCCTGTTCCAGACCATCGAGCAGTTCTGCCCCTGGTTCCCCGAGCAGGGCACCCTGGACCTGAAGGACTGGGAGAAGATCGGCAAGGAGCTGAAGCA 

GGCCAACCGCGAGGGCAAGATCATCCCCCTGACCGTGTGGMCGA 

GTGAGCGTGAGCGACGCCCCCMGAGCTGCGTGACCGACTGCGAGGAGGAGGCCGGCACC^GAGCCAGCAGGGCACCGAGAGCAGCCACTGCAAGT 

ACGTGGCOGAGAGCGTGATGGCCCAGAGCACCCAGAACGTGGACTACAGCCAGCTG 

CGGCCCCGAGAGCCT(^CCCAGCGAGCtt^ 

CAGGTGCGCCAGGCCCAGACCCCCCGCGAGAACCAGGTGGAGCGCGACCGCGTGAGCATCCCCGCCATGCCCACCCAGATCCAGTACCCCCAGTACC 
AGCCCGTGGAGAACAAGACCCAGCCCCTGGTGGTGTACCAGTACCGCCTGCCCACCGAGCTGCAGTACCGCCCCCCCAGCGAGGTGCAGTACCGCCC 

CCA(^CGTGT(£CCCGTGCCCAACA^^ 

CCCCAGCCCCCCACCGT<XGC^^^^ 

ACCT<&AG{£CTGGCGCTTCCTGGTGA^^ 

GCCCTTCACCATGAAGATGCTGMGGACATCAAGGAGGGCGTGAAGCAGTACGGCAGCMCAGCCCCTACATCCGCACCCTGCTGGACAGCA 

CACGGCMCCGCCTGACCCCCTACGACTGGGAGATCCTGGCCAAGAGCAGCCTGAGCAGCAGCCAGTACCTGCAGTT(^GACCTGGTGGATCGAC^ 

GCGTGCAGGAGC^GGTGCGCMGAACCAGGCCACCMGCCCACCGTGAACATCGACGCCGACCAGCTGCTGGGCACCGGCCCCAACTGGAGCACCAT 

CAACCAGCAGAGCGTGATGCAGMCGAGGCCATCGAGCAGGTGCGCGCC^ 
CCCATCAACAGCAICCGCCAGC^^ 

CCCGCMGGTGATCGTGGAGCTGATGGCCTACGAGMCGCCAACCCCGAGTGCCAGAGCGCCATCAAGCCCCTGAAGGGCAAGGTGCCCGCCGGCGT 
GGACGTGATCACCGAGTACGTGAAGGCCTGCGACGGCATCGGCGGCGCCATGCACAAGGCCATGCTGATGGCCCAGGCCATGCGCGGCCTGACCCTG 
GGCGGCCAGGTGCGCACCTTCGGCAAGAAGTGCTACMCTGCGGCCAGATCGGCCACCTGAAGCGCAGCTGCCCCGTGCTGAACMG(^ 

TCMCCAGGCCATCACCGCCMGAACMGMGCCCAGCGGCCTGTGCCC^ 
GGACAAGC^CGGCCAGCCCCTGAGCGGCMCCGCAAC^GCGGCCAGCCCCAGGCCC 

CAGGGCTTCCAGGGCCAGCAGCCCCTGCAGMGATCCCCCCCCTGCAGGGCGTGAGCCAGCTGCAGCAGAGCMCAGCTGCCCCGCCCCCCAGCA^ 

CCGCCCCCCAGGCTTAA 

SEQ ID 78 1 

MGQTESECyASYLSFIKILIiRRGGVRASTENLITLFQTIEQFCPWFPEQGTLDLKDWEKIGKELKQANREGKIIPLTWNDWAIIKATLEPFQTGEDI 
VSVSDAPKSCTTDCEEEAGTESQQGTESSHCKYVAESVMAQSTQW 

QVRQAQTPRENQVERDRVSIPAMPTQIQYPQYQPVENKTQPLWYQYRLPTELQYRPPSEVQYRPQAVCPVPNSTAPYQQPTAMASNSPATQDAALY 

PQPPTVRLNPTASRSGQGGALHAVIDEARKQGDLEAWRFLVILQLVQAGEETQVGAPARAE.TRCEPFTMKMLKDIKEGVKQYGSNS 

HGNRLTPYDWEIIAKSSLSSSQYLQFKTWWIIX5VQEQVRKNQATKPTVNIDADQLLGTGPNWSTINQQSVMQNEAIEQVRAI 

PINSIRQGSKEPYPDFVARLQDMQKSITDDNARKV^^ 

GGQVRTFGKKC YNCGQIGHLKRSC PVLNKQN 1 1 NQAI TAKNKKPSGLC PKCGKGKHWANQCHS KFDKDGQPLSGNRKRGQPQAPQQTGAFPVQLFVP 
QGFQGQQPLQKI PPIjQGVSQLQQSNSC PAPQQAAPQ 

mgqSskyasylsfikiliarggvrastenlitlfqtieqfcpwf 

VSVSDAPKSCVTDCEEEAGTESQQGTESSHCKYVAESVmQST^ 
QVRQAQTPRENQVE^RVSIPAMPTQIQYPQYQPVENKTQPI^ 

PQPPTVRLNPTASRSGQGGALHAV I DE ARKQGDLE AWRFLVILQLVQAGEETQVGAPARAETRCEPFTMKMLKDIKEGVKQYGSNSPY I RTLLDS I A 

HGNRLTPYDWEILAKSSLSSSQYLQFKTHWIDGVQEQVRKNQATKPTVNIDADQLLGT 

PINSIRQGSKEPYPDFVARLQDMQKSITDDNARKVIVELMAYENM 

GGQVRTFGKKCYNCGQIGHLKRSCPVLNKQNIINQAITAKNKKPSGLCPKCGKGKHWANQCHSKFDKDGQPLSGNRKRGQ 

QGFQGQQPLQKIPPLQGVSQLQQSNSCPAPQQAAPQA 

SEQ ID 80 

GCCGCGGAATTTCGACTCTAGGCCATTGCATACGTTGTATCTATATCATMTATGTACATTTATATTGGCTCATGTCCAATATGACCGCCATGTTGA 
CATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATG 
GCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTMCGCCAATAGGGACTTTCCATTGACGTCA 
ATGGGTGGAGTATTTACGGTAMCTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTCCGCCCCCTATTGACGTCAATGACGGTAAATGG 
CCCGCCTGGCATTATGCCCAGTACATGACCTTACGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGATGCGGT 
TTTGGCAGTACACCAATGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCA 
AAATCAACGGGACTTTCCAAAATGTCGTAATAACCCCGCCCCGTTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAGGTCTATATAAGCAGAGCT 
CGTTTAGTGAACCGTCAGATCGCCTGGAGACGCCATCCACGCTGTTTTGACCTCCATAGAAGACACCGGGACCGATCCAGCCTCCGCGGCCGGGAAC 
GGTGCATTGGAACGCGGATTCCCCGTGCCAAGAGTGACGTAAGTACCGCCTATAGACTCTATAGGCACACCCCTTTGGCTCTTATGCATGCTATACT 

GTTTTTGGCTTGGGGCCTATACACCCCCGCTTOT 

TCCCCTATTGGTGACGATACTTTCCATTACTAATCCATAACATGGCTCTTTGCCACAACTATCTCTATTGGCTATATGCCAATACTCTGTCCTTCAG 
AGACTGACACGGACTCTGTATTTTTACAGGATGGGGTCCCATTTATTATTTACAAATTCACATATACAACAACGCCGTCCCCCGTGCCCGCAGTTTT 
TATTAMCATAGCGTGGGATCTCCACGCGAATCTCGGGTACGTGTTCCGGACATGGGCTCTTCTCCGGTAGCGGCGGAGCTTCCACATCCGAGCCCT 
GGTCCCATGCCTCCAGCGGCTCATGGTCGCTCGGCAGCTCCTTGCTCCTAACAGTGGAGGCCAGACTTAGGCACAGCACAATGCCCACCACCACCAG 
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TGT^CGCACAAGGCCGTGGCGGTAGGGTATGTGTCTGAARATGAGCTCGGRGATTGGGCTCGCACCGCTGACGCAGATGGAAGRCTTAAGGCAGCG 

GCAGMGAAGATGCAGGCAGCTGAGTTGTTGTATTCTGATAAGAGTCAGAGGTAACTCCCGTTGCGGTGCTGTTMCGGTGGAGGGCAGTGTAGTCT 

GAGCAGTACTCGTTGCTGCCGCGCGCGCCACCAGACATAATAGCTGACAGACTAACAGACTGTTCCTTTCCATGGGTCTTTTCTGCAGTCACCGTCg 

tcgacgccaccatgggccagaccgagagcaagtacgccagctacctgagcttcatcaagatcctgctgcgccgcggcggcgtgcgcgccagcaccga 

gaacctgatcaccctgttccagaccatcgagcagttctgcccctggttccccgagcagggcaccctggacctgaaggactgggagaagatcggcaag 

gagctgaagcaggccaaccgcgagggcaagatcatccccctgaccgtgtggaacgactgggccatcatcaaggccaccctggagcccttccagaccg 

gcgaggacatcgtgagcgtgagcgacgcccccaagagctgcgtgaccgactgcgaggaggaggccggcaccgagagccagcagggcaccgagagcag 

ccactgcaagtacgtggccgagagcgtgatggcccagagcacccagaacgtggactacagccagctgcaggagatcatctaccccgagagcagcaag 

ctgggcgagggcggccccgagagcctgggccccagcgagcccaagccccgcagccccagcacccccccccccgtggtgcagatgcccgtgaccctgc 

agccccagacccaggtgcgccaggcccagaccccccgcgagaaccaggtggagcgcgaccgcgtgagcatccccgccatgcccacccagatccagta 

cccccagtaccagcccgtggagaacaagacccagcccctggtggtgtaccagtacggcctgcccaccgagctgcagtaccgcccccccagcgaggtg 

cagtaccgcccccaggccgtgtgccccgtgcccaacagcacxgccccctaccagcagcccaccgccatggccagcaacagccccgccacccaggacg 

ccgccctgtacccccagccccccaccgtgcgcctgaaccccaccgccagccgcagcggccagggcggcgccctgcacgccgtgatcgacgaggcccg 

caagcagggcgacctggaggcctggcgcttcctggtgatcctgcagctggtgcaggccggcgaggagaccciggtgggcgcccccgcccgcgccgag 

acccgctgcgagcccttcaccatgaagatgctgaaggacatcaaggagggcgtgaagcagtacggcagcaacagcccctacatccgcaccctgctgg 

acagcatcgcccacggcaaccgcctgaccccctacgactgggagatcctggccaagagcagcctgagcagcagccagtacctgcagttcaagacctg 

gtggatcgacggcgtgcaggagcaggtgcgcaagaaccaggccaccaagcccaccgtgaacatcgacgccgaccagctgctgggcaccggccccaac 

tggagcaccatcaaccagcagagcgtgatgcagaacgaggccatcgagcaggtgcgcgccatctgcctgcgcgcctggggcaagatccaggaccccg 

gcaccgccttccccatcaacagcatccgccagggcagcaaggagccctaccccgacttcgtggcccgcctgcaggacgccgcccagaagagcatcac 

cgacgacaacgcccgcaaggtgatcgtggagctgatggcctacgagaacgccaaccccgagtgccagagcgccatcaagcccctgaagggcaaggtg 

cccgccggcgtggacgtgatcaccgagtacgtgaaggcctgcgacggcatcggcggcgccatgcacaaggccatgctgatggcccaggccatgcgcg 

gcctgaccctgggcggdcaggtgcgcaccttcggcaagaagtgctacaactgcggccagatcggccacctgaagcgcagctgccccgtgctgaacaa 

gcagaacatcatcaaccaggccatcaccgccaagaacaagaagcccagcggcctgtgccccaagtgcggcaagggcaagcactgggccaaccagtgc 

cacagcaagttcgacaaggacggccagcccctgagcggcaaccgcaagcgcggccagccccaggccccccagcagaccggcgccttccccgtgcagc 

tgttcgtgccccagggcttccagggccagcagcccctgcagaagatcccccccctgcagggcgtgagccagctgcagcagagcaacagctgccccgc 

ScccagSgS 

AGTCTCGACTCTGCCTTCTAGTTGCCAGCCATCTGTTGTTTGCCCCTC 

TAATAAMTGAGGAMTTGCATCGCATTGTCTGAGTAGGTGrcATTCTATTCTGGGGGGTGGGGTGGGGCAGGACAGCMGGGGGAGGATTGGGAAG 

ACAATAGCAGGGGGGTGGGCGAAGAACTCCAGCATGAGATCCCCGCGCTGGAGGATCATCCAGCCGGCGTCCCGGAARACGATTCCGAAGCCCAACC 

TTTCATAGMGGCGGCGGTGGAArCGAMTCTCGTGATGGCAGGTTGGGCGTCGCTTGGTCGGTCATTTCGAACCCCAGAGTCCCGCTCAGAAGAAC 

TCGTCMGAAGGCGATAGAAGGCGATGCGCTGCGAATCGGGAGCGGCGATACCGTAAAGCACGAGGAAGCGGTCAGCCCATTCGCCGCC 

CAGCAATATCACGGGTAGCCMCGCTATGTCCTGATAGCGGTCCGCCAC^CCCAGCCGGCCACAGTCGATGAATCCAGAAAAGCGGCCAT^TCCAC 

CATGATATCCGGCAAGCAGGCATCGCCATGGGTCACGACGA 

AGCCCCTGATGCTCTTCGTCCAGATCATCCTGATCGACAAGACCGGCTTCCATCCGAGTACGTGCTCGCTCGATGCGATGTTTCGCTTGGTGGTCGA 
ATGGGCAGGTAGCCGGATCAAGCGTATGCAGCCGCCGCATTGCATCAGCCATGA^ 

CTGCCCCGGCACTTCGCCCMTAGCAGCCAGTCCCTTCCCGCTTCAGTGACMCGTCGAGCACAGCTGCGCAAGGAAC 

GATAGCCGCGCTGCCTCGTCCTGCAGTTCATTCAGGGCACCGGACAGGTCGGTCTTGACAAAAAGAACCGGGCGCCCCTGCGCTGACAGCCGGAACA 
CGGCGGCATCAGAGCAGCCGATTGTCTGTTGTGCCWGTCATAGCCGAATAGCCTCTCCACCCAAGCGGCCGGAGMCCTGCGTGCAATCCATCTTG 
TTCMTCATGCGAAACGATCCTCATCCTGTCTCTTGATCAGATCTTGATCCCCTGCGCCATCAGATCCTTGGCGGCAAGAAAGCCATCCAGTTTACT 
TTGCAGGGCTTCCCAACCTTACCAGAGGGCGCCCCAGCTIGGCAATTCCGGTTCGCTTGCTGTCCATAAAACCGCCCAGTCTAGCTATCGCCATGTAA 
GCCCACTGCAAGCTACCTGCTTTCTCTTTGCGCTTGCGTTTTCCCTTGTCCAGATAGCCCAGTAGCTGACATTCATCCGGGGTCAGCACCGTTTCTG 
CGGACTGGCTTTCTACGTGTTCCGCTTCCTTTAGCAGCCCTTGCGCCCTGAGTGCTTGCGGCAGCGTGMGCTAATTCAIGGTTAAATTTTTGTTAA 

ATCAGCTCATTTTTTMCCMTAGGCCGAAATCGGCAAAATCCCTTATAAATC^ 

ACAAGAGTCCACTATTAAAGAACGTGGACTCCAACGTCAAAGGGCGAAAAACCGTCTATCAGGGCGATGGCCGGATCAGCTTATGCGGTGTGAAATA 
CCGCACAGATGCGTAAGGAGAAAATACCGCATCAGGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGG 
TATCAGCTCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAMGGCCAGCA 
CCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCG 

ACAGGACTATAAAGATACCAGGCGnTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTOTTCC<aCCa 
TCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCGGTGTAG 

CCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGCAGCAGCCACTGGT 
AACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCT 
GCGCTCTGCTGMGCCAGTTACCTTCGGAAAMGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGCGGTGGTTTTTTTGTTTGCAA 
GCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGATCTTTTCTACTGAACGGTGATCCCCACCGGAATTGCG 

AIGAACCCAAGCGAGATGCAMGAAAAGCACCTCCGCGGAGACGGAGACATC^ 

CAGAAGAACAGATGAAGTTGCCATCCACCAAGAAGGCAGAGCCGCCAACTTGGGCACAACTAAAGAAGCTGACGCAGTTAGCTACAAAATATCTAGA 
GAACACAAAGGTGACACAAACCCCAGAGAGTATGCTGCTTGCAGCCTTGATGATTGTATCAATGGTGGTAAGTCTCCCTATGCCTGCAGGAGCAGCT 
GCAGCTAACTATACCTACTGGGCCTATGTGCCTTTCCCGCCCTTAATTCGGGCAGTCACATGGATGGATAATCCTACAGAAGTATATGTTAATGATA 
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GTGTATGGGTACCTGGCCCCATAGATGATCGCTX^CCTGCCAAACCT^ 
TATTTGCCTAGGGAGAGCACCAGGATGTTTAATGCCTGCAGTCCAAAAT^ 
CACAT(&TAAGCGGGATGTCACTCAG(£^ 
GCCCCAAGGAAATTCCCAMGMTCAAAAAATAC^^ 

CGGMCTATTATAGATTGGGCACCTCGAGGTCMTTCTACCACMTTGCTCAGGACAAACTCAGTCGTGTCCAAGTGCACAAGTGAGTCCAGCTGTT 

GATAGCGACTTMCAGAAAGTTTAGACAAACATMGCATAAAAMTTGC^^ 

GACCAAAMTAGTAAGTCCTGTTTCTGGTCCTGAACATCCAGM^ 

AACTTTAGAAACMGAGATCGTMGCCATTTTATACTATO^ 

CTAGTTGTAGGAMTATAGTTATTAAACCAGACTCCCAGACTA^ 

AACACCGTATTCTGCTGGTGAGAGCMGAGAGG(£GTGTGGAT^ 

TGAAGTATTAAAAGGTGTTTTAAATAGATCCAAAAGATTCATTTTTACTTTAATTGCAGTGATTATGGGATTAATTGCAGTCACAGCTACGGCTGCT 

GTAGCAGGAGTTGCATTGCACTCTTCTGTO^ * 

TTGATCAAAAAnGGO\AATCAAATTMTGATCTTAGACA^ 

ATGTGACTGGMTACGTCAGATTTTTGTATTACACCCCAAATTT^ 

GMGATMTCTCACTTTAGACATTTCCAMTTAAMGAACAMT^ 

CAGGAGTTGCTGATGGCCTCGCAAATCTTMCCCTGTCACTTGGGT^ 

CCTGTTTTGTCTGTTGTTAGTCTGCAGGTGTACCC^ 

AAAAGAAAAGGGGGAAATGTGGGGAAAAGCAAGAGAGATCAGATTGTTACTGTGTCTGTGGCCTAA 
SEQ ID 82 

ATGMCCCCAGCGAGATGCAGCGCMGGCCCCCtt^ 
GCGAGGAGCAGATGMGCT(X:CCAGCACCMGAAGGCCGA^ 
; - GMCACCMGGTGACCCAGACCCCCGAGAGCATGCTGCTC 
GCCGCC^CTACACCTACTGGGCCTACCTGCC^^ 
GCGTGTGGGTGCCCGGCCCCATCGACGACCGCTGCCCCGCCAAGC 

CATCTGCCTGGGCCGCGCCCCCGGCTGCCTGATGCCCGCCGTGCAGAACTGGCTGGTGGAGGTGCCCACCGTGAGCCCCATCTGCCGCTTCACCTAC 
CACATGGTGAGCGGCATGAGCCTGCGCCCCCGC^^ 

GCCCCAAGGAGATCCCCAAGGAGAGCMGAACACCGAGGTGCTGGTGTGGGAGGAGTGC^ 
CGGCACCATCATC(^CTGGGCCCCCCGCGGtt^ 

GACAGCGACCTGACCGAGAGGCTGGACMGCACMGCACMGAAGCTGCAG^ 

GCCCCAAGATCGTGAGCCCCGTGAGCGGCCCCGAGCACCCCGAGCTGTGGCGCCTGACCGTGGCCAGCCACCACATCCGCATCTGGAGCGGCAACCA 
GACCCTGGAGACCCGCGACCGCAAGCCCTTCTACACCATCGACCTGAACAGCAGCCTGACCGTGCCCCTGCAGAGCTGCGTGAAGCCCCCCTACATG 
CTGGTGGTGGGCAACATCGTGATCAAGCCCGACAGCCAGACCATCACCTGCGAGAACTGCCGCCTGCTGACCTGCATCGACAGCACCTTCAACTGGC 
AGCACCGCATCCTGCTGGTGCGCGCCCGCGAGGGCGTGTGGATCCCCGTGAGCATGGACCGCCCCTGGGAGGCCAGCCCCAGCGTGCACATCCTGAC 
CGAGGTrcTGAAC^CGTGCTC^CCGCAGCMG 

GTGGCCGGCGTGGCCCTGCACAGCAGCGTGCAGAGCGTGAACTTCGTGAACGACTGGCAGAAGAACAGCACCCGCCTGTGGAACAGCCAGAGCAGCA 
TCGACCAGAAGCTGGCCAACCAGATCAACGACCTGCGCCAGACCGTGATCTGGATGGGCGACCGCCTGATGAGCCTGGAGCACCGCTTCCAGCTGCA 
GTGCGACTGGAACACCAGCGACTTCTGCATCACCCCCCAGATCTACAACGAGAGCGAGCACCACTGGGACATGGTGCGCCGCCACCTGCAGGGCCGC 
GAGGACAACCTGACCCTGGACATCAGCAAGCTGAAGGAGCAGATCTTCGAGGCCAGCAAGGCCCACCTGAACCTGGTGCCCGGCACCGAGGCCATCG 
CCGGCGTGGCCGACGGCCTGGCCAACCTGAACCCCGTGACCTGGGTGAAGACCATCGGCAGCACCACCATCATCAACCTGATCCTGATCCTGGTGTG 
CCTGTTCTGCCTGCTGCTGGTGTGCCGCTGCACCCAGCAGCTGCGCCGCGACAGCGACCACCGCGAGCGCGCCATGATGACCATGGCCGTGCTGAGC 
AAGCGCAAGGGCGGCAACGTGGGCAAGAGCAAGCGCGACCAGATCGTGACCGTGAGCGTGGCCTAA 

SEQ ID 83 

. MNPSEMQRKAPPRRRRHRNRAPLTHKMNRMVTSEEQMKLPSTKK^^ 
AANYTYWAWPFPPLIRAVTWMDNPTEVYVNDSW 

HMVSGMSLRPRVNYLQDFSYQRSLKFRPKGKPCPKEIPKESKNTEVLVWEECVANSAVILQNNEFGTIIDWAPRGQFYHNCSGQTQSCPSAQVSPAV 

DSDtTESLDKHKHKKLQSFYPWEflGEKGISTPRPKIVSPVSGPEHPELWRLTVASHHIRIWSGNQTLETRDRKPFYTIDLNSSLTVPLQSCVKPPYM 

LWGNIVIKPDSQTITCENCRLLTCIDSTFNWQHRILLVRAREGW 

VAGVALHSSVQSVNFVNDWQKNSTRLWNSQSSIDQKLANQIN^ 

EDNLTLDISKLKEQIFEASKAHI^LVPGTEAIAGVADGLANLNPVTWVKTIGSTTlINLILILVCLFCLLLVCRCTQQLRRDSDHi^ 
KRKGGN VGKS KRDQI VTVSVA 
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